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DESCRIPTION 

Human Proteins Having Hydrophobic Domains and 
DNAs Encoding These Proteins 

5 

TECHNICAL FIELD 

The present invention relates to human proteins 
having hydrophobic domains, DNAs encoding these proteins, 
expression vectors for these DNAs, eukaryotic cells 

10 expressing these DNAs and antibodies directed to these 
proteins. The proteins of the present invention can be 
employed as pharmaceuticals or as antigens for preparing 
antibodies directed to these proteins. The human cDNAs of 
the present invention can be utilized as probes for genetic 

15 diagnosis and gene sources for gene therapy. Furthermore, 
the cDNAs can be utilized as gene sources for producing the 
proteins encoded by these cDNAs in large quantities* Cells 
into which these genes are introduced to express secretory 
proteins or membrane proteins in large quantities can be 

20 utilized for detection of the corresponding receptors or 
ligands, screening of novel small molecule pharmaceuticals 
and the like. The antibodies of the present invention can be 
utilized for the detection, quantification, purification and 
the like of the proteins of the present invention. 



25 
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BACKGROUND ART 

Cells secrete many proteins extracellular ly. These 
secretory proteins play important roles in the proliferation 
control, the differentiation induction, the material 
5 transport, the biophylaxis, and the like of the cells. 
Unlike intracellular proteins, the secretory proteins exert 
their actions outside the cells. Therefore, they can be 
administered in the intracorporeal manner such as the 
injection or the drip, and they possess hidden 

10 potentialities as pharmaceuticals. In fact, a number of 
human secretory proteins such as interferons, interleukins, 
erythropoietin, thrombolytic agents and the like are 
currently employed as pharmaceuticals. In addition, 
secretory proteins other than those described above are 

15 undergoing clinical trials for developing their use as 
pharmaceuticals . It is believed that the human cells produce 
many unknown secretory proteins. Availability of these 
secretory proteins as well as genes encoding them is 
expected to lead to development of novel pharmaceuticals 

20 utilizing them. 

On the other hand, membrane proteins play 
important roles, as signal receptors, ion channels, 
transporters and the like, in the material transport and the 
signal transduction through the cell membrane. Exairqples 

25 thereof include receptors for various cytokines, ion 
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channels for the sodium ion, the potassium ion, the chloride 
ion and the like, transporters for saccharides, amino acids 
and the like. The genes for many of them have already been 
cloned. It has been clarified that abnormalities in these 
5 membrane proteins are involved in a number of previously 
cryptogenic diseases. Therefore, discovery of a new membrane 
protein is expected to lead to elucidation of the causes of 
many diseases, and isolation of new genes encoding the 
membrane proteins has been desired* 

10 Heretofore, due to difficulty in the purification 

from human cells, many of these secretory proteins and 
membrane proteins have been isolated by genetic approaches. 
A general method is the so-called expression cloning method, 
in which a cDNA library is introduced into eukaryotic cells 

15 to express cDNAs, and the cells secreting, or expressing on 
the surface of membrane, the protein having the activity of 
interest are then screened. However, only genes for proteins 
with known functions can be cloned by using this method. 

In general/ a secretory protein or a membrane 

20 protein possesses at least one hydrophobic domain within the 
protein. After synthesis on ribosomes, such domain works as 
a secretory signal or remains in the phospholipid membrane 
to be entrapped in the membrane. Accordingly, if the 
existence of a highly hydrophobic domain is observed in the 

25 amino acid sequence of a protein encoded by a cDNA when the 
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whole base sequence of the full-length cDNA is determined, 

it is considered that the cDNA encodes a secretory protein 
or, a membrane protein • 

5 OBJECTS OF INVENTION 

The main object of the present invention is to 
provide novel human proteins having hydrophobic domains, 
DNZVs coding for these proteins, expression vectors for these 
DNAs, transformed eucaryotic cells that are capable of 
10 expressing these JMUs and antibodies directed to these 
proteins- 

SUMMARY OF INVENTION 

As the result of intensive studies, the present 
inventors have successfully cloned cDNAs encoding proteins 
having hydrophobic domains from the hxrnian full-length cDNA 
bank, thereby completing the present invention. Thus, the 
present invention provides a human protein having 
hydrophobic domain (s), namely a protein comprising any one 
of amino acid sequences selected from the group consisting 
of SEQ ID NOS: 1 to 10, 31 to 40, 61 to 70, 91 to 100 and 
121 to 130. Moreover, the present invention provides a DNA 
encoding said protein, exemplified by a cBNA comprising any 
one of base sequences selected from the group consisting of 
SEQ ID NOS: 11 to 30, 41 to 60, 71 to 90, 101 to 120 and 131 
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15, 



20 



25 



to 150, an expression vector that is capable of expressing 
said DMA. by in vitro translation or in eukaryotic cells, a 
transformed eukaryotic cell that is capable of. expressing 
said DISIA and of producing said protein, and an antibody 
directed to said protein. 

This object as well as other objects and 
advantages ,of the present invention will become apparent to 
those skilled in the art from the following description with 
reference to the accompanying drawings. 

BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 : A figure depicting . the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03613. 

Figure 2: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03700, 

Figure 3 : A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03935. 

Figure 4: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP10755. 

Figure 5: A figure depicting the' 

hydrophobicity/hydrophilicity profile of the protein 
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encoded by clone HP10760, 

Figure 6: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HPl 07 64. 
5 Figure 7: A figure depicting , the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HPl 07 68. 

Figure 8: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
10 encoded by clone HPl 07 69. 

Figure 9: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP10784. 

Figure 10: A figure depicting the 

15 hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HPl 078 6. 

Figure 11: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03727. 
20 Figure 12 :A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03801. 

Figure 13: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
25 encoded by clone HP03883. 
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Figure 14: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03913. 

Figure 15: A figure depicting the 

5 hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP10753. 

Figure 16: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP10758, 

10 Figure 17: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
- encoded by clone HP10771. 

Figure 18: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
15 encoded by clone HP10778. 

Figure 19: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP10781. 

Figure 20: A figure depicting the 

20 hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP10785. 

Figure 21: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03878. 

25 Figure 22; A figure depicting the 
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hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03884. 

Figure 23: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
5 encoded by clone HP03934. 

Figure 24 : A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03949. 

Figure 25: A figure depicting the 

10 hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03959. 

.Figure 26: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP03983. 

15 Figure 27: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP10745. 

Figure 28: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
20 encoded by clone HP10775. 

Figure 29: A figure depicting the 

hydrophobicity/hydrophilicity profile of the protein 
encoded by clone HP10782. 

Figure 30: A figure depicting the 
25 hydrophobicity/hydrophilicity profile of the protein. 
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Figure 31 :A figure 
hydrophobicity/hydrophilicity profile 
encoded by clone HP03977 . 

Figure 32: A figure 
5 hydrophobicity/hydrophilicity profile 
encoded by clone HP10649. 

Figure 33: A figure 
hydrophobicity/hydrophilicity profile 
encoded by clone HP10779. 
10 Figure 34: A . figure 

hydrophobicity/hydrophilicity profile 
encoded by clone HP10790. 

Figxure 35: A figure 
hydrophobicity/hydrophilicity profile 
15 encoded by clone HF10793. 

Figure 36: A figure 
hydrophobicity/hydrophilicity profile 
encoded by clone HP10794. 

Figure 37: A figure 
20 hydrophobicity/hydrophilicity profile 
encoded by clone HP10797. 

Figure 38: A figure 
hydrophobicity/hydrophilicity profile 
■encoded by clone HP10798. 
25 Figure 39: A figure 
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hydrophobicity/hydrophilicity profile 
encoded by clone HP10800. 

Figure 40: A figure 
hydrophobicity/hydrophilicity profile 
5 encoded by clone HP10801- 

Figure 41 :A figure 
hydrophobicity/hydrophilicity profile 
encoded by clone HP03596. 

Figure 42: A figure 
10 hydrophobicity/hydrophilicity profile 
encoded by clone HP03882. 

Figure 43 :A figure 
hydrophobicity/hydrophilicity profile 
encoded by clone HP03903. 
15 Figure 44: A figure 

hydrophobicity/hydrophilicity profile 
encoded by clone HP03974- 

Figxire 45: A figure 
hydrophobicity/hydrophilicity profile 
20 encoded by clone HP03978. 

Figure 46: A figure 
hydrophobicity/hydrophilicity profile 
encoded by clone HP10735. 

Figure 47: A figure 
25 hydrophobicity/hydrophilicity profile 
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encoded by clone HP10750. 

Figure 48: A figure depicting 

hydrophobicity/hydrophilicity profile of the 
encoded by clone HP10777. 
5 Figure 49: A figure depicting 

hydrophobicity/hydrophilicity profile of the 
encoded by clone HP10780. 

Figure 50: A figure depicting 

hydrophobicity/hydrophilicity profile of the 
10 encoded by clone HP10795. 

' DETAILED DESCRIPTION OF THE INVENTION 

The proteins of the present invention can be 
obtained, for example, by a method for isolating proteins 

15 from human organs, cell lilies or the like, a method for 
preparing peptides by the chemical synthesis based on the 
amino acid sequence of the present invention, or a method 
for producing proteins by the recombinant DNA technology 
using the DNAs encoding the hydrophobic domains of the 

20 present invention- Among these, the method for producing 
proteins by the recombinant DNA technology is preferably 
employed. For example, the proteins can be expressed in 
vitro by preparing an RNA by in vitro transcription from a 
vector having the cDNA of the present invention, and then 

25 carrying out in vitro translation using this RNA as a 
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teit^late* Alternatively/ incorporation of the translated 
region into a suitcible expression vector by the method known 
in the art may lead to expression of the encoded protein in 
large quantities in prokaryotic cells such as Escherichia 
5 coli and Bacillus suhtilis, or eukaryotic cells such as 
yeasts, insect cells and mainmalian cells. 

In the case where the protein of the present 
invention is produced by expressing the DNA by in vitro 
translation, the protein of the present invention can be 

10 produced in vitro by incorporating the translated region of 
this cDNA into a vector having an RNA polymerase promoter, 
and then adding the vector to an in vitro translation system 
such as a rabbit reticulocyte lysate. or a wheat germ extract, 
which contains an RNA polymerase corresponding to the 

15 promoter. The RNA polymerase promoters are exemplified by T7, 
T3, SP6 and the like. The vectors containing promoters for 
these RNA polymerases are exemplified by pKAl, pCDM8, pT3/T7 
18, pT7/3 19, pBluescript II and the like. Furthermore, the 
protein of the present invention can be expressed in the 

20 secreted form or the form incoirporated in the microsome 
membrane when a canine pancreas microsome or the like is 
added to the reaction system. 

In the case where the protein of the present 
invention is produced by expressing the DNA in a 

25 microorganism such as Escherichia coli, a recombinant 
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expression vector in which the translated region of the cDNA 
of the present invention is incorporated into an expression 
vector having an origin which is capable of replicating in 
the microorganism, a promoter, a ribosome-binding site, a 
5 cDNA-cloning site, a terminator and the like is constructed. 
After transformation of the host cells with this expression 
vector, the resulting trans formant is cultured. Thus, the 
protein encoded by the cDNA can be produced in large 
quantities in the ndcroorganism. In this case, a protein 

10 fragment containing any translated region can be obtained by 
adding an initiation codon and a termination codon in front 
of and behind the selected translated region and expressing 
the protein. Alternatively, the protein can be expressed as 
a fusion protein with another protein. Only the portion of 

15 the protein encoded by the cDNA can be obtained by cleaving 
this fusion protein with a suitable protease. The expression 
vectors for Escherichia coli are exemplified by the pUC 
series, pBluescript II, the pET expression system, the pGEX 
expression system and the like. 

20 In the case where the protein of the present 

invention is produced by expressing the DNA in eukaryotic 
cells, the protein of the present invention can be produced 
as a secretory protein, or as a membrane protein on the 
surface of cell membrane, by incorporating the translated 

25 region of the cDNA into an expression vector for eukaryotic 
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cells that has a promoter^ a splicing region, a poly (A) 
addition site and the like, and then introducing the vector 
into the eukaryotic cells. The expression vectors are 
exemplified by pKM, pED6dpc2, pCDMB, pSVK3, pMSG, pSVL, 
5 pBK-CHV, pBK-RSV, EBV vectors, pRS, pYES2 and the like. 
Examples of eukaryotic cells to be used in general include 
maiamalian cultured cells such as monkey kidney C0S7 cells 
and Chinese hamster ovary CHO cells, budding yeasts, fission 
yeasts, silkworm cells, and Xenopus oocytes. Any eukaryotic 

10 cells may be used as long as they are capable of expressing 
the proteins, of the present un^!PGnt ion. The expression vector 
can be introduced into the eukaryotic cells by using a 
method known in the art such as the electroporation method, 
the calcium phosphate method, the liposome method and the 

15 DEAE-dextran method. 

After the protein of the present invention is 
expressed in prokaryotic cells or eukaryotic cells, the 
protein of interest can be isolated and purified from the 
culture by a combination of separation procedures known in 

20 the art. Examples of the separation procedures include 
treatment with a denaturing agent such as urea or a 
detergent, sonication, enzymatic digestion, salting-out or 
solvent precipitation, dialysis, centrifugation, 

ultrafiltration, gel filtration, SDS-PAGE, isoelectric 

25 focusing, ion-exchange chromatography, hydrophobic 
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chromatography, affinity chromatography and reverse phase 
chromatography . 

The proteins of the present - invention also include 
peptide fragments (of 5 amino acid residues or more) 
containing any partial amino acid sequences in the amino 
acid sequences represented by SEQ ID NOS: 1 to 10, 31 to 40, 
61 to 70, 91 to 100 and 121 to 130. These peptide fragments 
can be utilized as antigens for preparation of antibodies- 
Among the proteins of the present invention, those having 
the signal sequences are secreted in the form of mature 
proteins after the signal sequences are removed. Therefore, 
these mature proteins shall come within the scope . of the 
protein of the present invention • The N-terminal amino acid 
sequences of the mature proteins can be easily determined by 
using the method for the determination of cleavage site of a 
signal sequence [ 8-187100] . Furthermore, some membrane 

proteins undergo the processing on the cell surface to be 
converted to the secreted forms. Such proteins or peptides 
in the secreted forms shall also come within the scope of 
the protein of the present invention. In the case where 
sugar chain-binding sites are present in the amino acid 
sequences of the proteins, expression of the proteins in 
appropriate eukaryotic cells affords the proteins to which 
sugar chains are added* Accordingly, such proteins or 
peptides to which sugar chains are added shall also come 
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within the scope of the protein of the present invention. 

The DNAs of the present invention include all the 
DNAs encoding the above-mentioned proteins. These DNAs can 
be obtained by using a method for chemical synthesis, a 

.5 method for cDNA cloning and the like. 

The cDNAs of the present invention can be cloned, 
for example, -from cDNA libraries derived from the human 
cells. The cDNZVs are synthesized by using poly (A) ^ mus 
extracted from human cells as templates. The human cells may 

10 be cells delivered from the hman body, for example, by the 
operation or may be the cultured cells. The cDNAs can be 
synthesized by using any method such as the Okayama-Berg 
method [Okayama, H. and Berg, P., Mol. Cell. Biol. 2: 161- 
170 (1982)], the Gubler-Hof fman method [Gubler, U. and 

15 Hoffman, J., Gene 25: 263-269 (1983) ] and the like. However, 
it is desirable to use the capping method [Kato, S, et al.. 
Gene 150: 243-250 (1994)], as exemplified in Examples, in 
order to obtain a full-length clone in an effective manner. 
In addition, commercially available human cDNA libraries can 

20 be utilized. The cDNAs of the present invention can be 
cloned from the cDNA libraries by synthesizing an 
oligonucleotide on the basis of base sequences of any 
portion in the cDNA of the present invention and screening 
the cDm libraries using this oligonucleotide as a probe for 

25 colony or plaque hybridization according to a method loiown 
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in the art. In addition, the cDNA fragments of the present 
invention can be prepared from an iciRNA. isolated from hioman 
cells by the RT-PCR method in which oligonucleotides which 
hybridize with both termini of the cDNA. fragment of interest 
are synthesized, which are then used as the primers • 

The cDNAs of the present invention are 
characterized in that they comprise any one of the base 
sequences represented by SEQ ID NOS: 11 to 20, 41 to 50, 71 
to 80, 101 to 110 and 131 to 140 or the base sequences 
represented by SEQ ID NOS: 21 to 30, 51 to 60, 81 to 90, 111 
to 120 and 141 to 150. Table 1 summarizes the clone number 
(HP number), the cells from which the cDWi clone was 
obtained, the total niamber of bases of the cDNA, and the 
number of the amino acid residues of the encoded protein, 
for each of the cDNZ^. 
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Table 1 



Sequence No. 


HP No- 


Cell 




Number 
. of 
bases 


Number of 
amino 
acids 




11, 


21 


HP03613 


Kidney 






2865 


578 


2. 


12, 


22 


HP03700 


Kidney 






3323 


243 


3, 


13, 


23 


HP03935 


ECidney 






1585 


461 


4, 


14, 


24 


HP10755 


Kidney 






2122 


647 


5, 


15, 


25 


HP10760 


Kidney 






1775 


446 


6, 


16, 


26 


HP10764 


Kidney 






1372 


197 


7r 


17, 


27 


HP10768 


Kidney 






2074 


540 


8, 


18, 


28 


HP10769 


Kidney 






2252 


442 


9, 


19, 


29 


HP10784 


Kidney 






1461 


262 


10, 


20, 


30 


HP10786 


Kidney 






1122 


152 


31, 


41, 


51 


HP03727 


Kidney 






1617 


335 


32, 


42, 


52 


HP03801 


Unbilical 


cord 


blood 


1749 


208 


33, 


43, 


53 


HP03883 


Kidney 






1402 


406 


34, 


44, 


54 


HP03913 


Kidney 






2474 


618 


35, 


45, 


55 


HP10753 


IMoilical 


cord 


blood 


3296 


208 


36, 


46, 


56 


HP10758 


Kidney 






1818 


502 


37, 


47, 


57 


HP10771 


BCidney 






1646 


336 


38, 


48, 


58 


HP10778 


Kidney 






1416 


340 


39, 


49, 


59 


HP10781 


Kidney 






1927 


223 


40, 


50, 


60 


HP10785 


Kidney 






1419 


309 


61, 


71, 


81 


HP03878 


Kidney 






2016 


599 


62, 


72, 


82 


HP03884 


Kidney 






1446 


81 


63, 


73, 


83 


HP03934 


Kidney 






2467 


654 


64, 


74, 


84 


HP03949 


Kidney 






1450 


390 


65, 


75, 


85 


HP03959 


Kidney 






1897 


452 
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Table 1 (continued) 

Sequence No. HP No. Cell Number Number of 

of amino 
^ bases acids 



66, 


76, 


86 


HP03983 


Kidney 




1856 


490 


67, 


77-, 


87 


HP10745 


Umbilical 


cord blood 


2173 


392 


68, 


78, 


88 


HP10775 


Kidney 




1934 


o 


69, 


79, 


89 


HP10782 


Kidney 




1880 


J. w z. 


70, 


80, 


90 


HP10787 


Kidney 








91, 


101, 


111 


HP03977 


BCidney 




1894 




92, 


102, 


112 


HP10649 


KB 




2413 


352 


93, 


103, 


113 


HP10779 


Kidney 




2376 


J — >\j 


94, 


104, 


114 


HP10790 


BCidney 




1155 


330 


95, 


105, 


115 


HP10793 


Kidney 




1329 


350 


96, 


106, 


116 


HP10794 


Kidney 




1387 


113 


97, 


107, 


117 


HP10797 


Kidney 




1158 


189 


98, 


108, 


118 


HP10798 


Kidney 




1106 


277 


99, 


109, 
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The same clones as the cDNZVs of the present 
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invention can be easily obtained by screening the cDNA 
libraries constructed from the hiiman cell lines or human 
tissues utilized in the . present invention using an 
oligonucleotide probe synthesized on the basis of the base 
5 sequence of the cDNA provided in any one of SEQ ID NOS: 11 
to 30, 41 to 60, 71 to 90, 101 to 120 and 131 to 150. 

In general, the polymorphism due to the individual 
differences is frequently observed in human* genes. 
Accordingly, any cDNA in which one or plural nucleotides are 
10 added, deleted and/or substituted with other nucleotides in 
SEQ ID NOS; 11 to 30, 41 to 60, 71 to 90, 101 to 120 and 131 
to 150 shall come within the scope of the present invention. 

Similarly, any protein in which one or plural 
amino acids are added, deleted and/or substituted with other 
15 amino acids resulting from the above-mentioned changes shall 
come within the scope of the present invention, as long as 
the protein possesses the activity of the protein having any 
one of the amino acid sequences represented by SEQ ID NOS: 1 
to 10, 31 to 40, 61 to 70, 91 to 100 and 121 to 130, 
20 The cDNAs of the present invention also include 

cDNA fragments (of 10 bp or more) containing any partial 
base sequence in the base sequences represented by SEQ ID 
NOS: 11 to 20, 41 to 50, 71 to 80, 101 to 110 and 131 to 140 
or in the base sequences represented by SEQ ID NOS: 21 to 30, 
25 51 to 60, 81 to 90, 111 to 120 and 141 to 150. Also, DNA 
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fragments each consisting of a sense strand and an anti- 
sense strand shall come within this scope. These DNA 
fragments can be utilized as the probes for the genetic 
diagnosis* 

5 The antibody of the present invention can be 

obtained from a serum after immunizing an animal using the 
protein of the present invention as an antigen. A peptide 
that is chemically synthesized based on the amino acid 
sequence of the present invention and a protein expressed in 

10 eukaryotic or prokaryotic cells can be used as an antigen. 
Alternatively, an antibody can be prepared by introducing 
the above-mentioned expression vector for, eukaryotic cells 
into the muscle or the skin of an animal by injection or by 
using a gene gun and then collecting a serum therefrom [ JP-A 

15 7-313187] . Animals that can be used include a mouse, a rat, 
a rabbit, a goat, a chicken and the like. A monoclonal 
antibody directed to the protein of the present invention 
can be produced by fusing B cells collected from the spleen 
of the immunized animal with myelomas to generate hybridomas, 

20 In addition to the activities and uses described 

above, the polynucleotides and proteins of the present 
invention may exhibit one or more of the uses or biological 
activities {including those associated with assays cited 
herein) identified below. Uses or activities described for 

25 proteins of the present invention may be provided by 
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administration or use of such proteins or by adiainistration 
or use of polynucleotides encoding such proteins (such as; 
for example, in gene therapies or vectors suitable for 
introduction of DNA) . 

Research Uses and Uti lities 

The polynucleotides provided by " the present 
invention can be used by the research coiraaunity for various 
purposes- The polynucleotides can be used to express 
recombinant protein for analysis, characterization or 
therapeutic use; as markers for tissues in which the 
corresponding protein is preferentially expressed (either 
constitutively or at a . particular stage of tissue 
differentiation or development or in disease states) ; as 
molecular weight markers on Southern gels; as chromosome 
markers or tags (when labeled) to identify chromosomes or to 
map related gene positions; to compare with endogenous DNZV 
sequences in patients to identify potential genetic 
disorders; as probes to hybridize and thus discover novel, 
related DNA sequences; as a source of information to derive 
PGR primers for genetic fingerprinting; as a probe to 
"subtract-out" known sequences in the process of discovering 
other novel polynucleotides; for selecting and making 
oligomers for attachment to a "gene chip" or other support, 
including for examination of expression patterns; to raise 
anti-protein antibodies using DNA immunization techniques; 
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and as an antigen to raise anti-DMA antibodies or elicit 
another immxane response* Where the polynucleotide encodes a 
protein which binds or potentially binds to another protein 
(such as, for example, in a receptor-ligand interaction) , 
5 the polynucleotide can also be Used in interaction trap 
assays (such as, for example, that described in Gyuris et 
al.. Cell 75:791-803 (1993) ) to identify polynucleotides 
encoding the other protein with which binding occurs or to 
identify inhibitors of the binding interaction. 

10 The proteins provided by the present invention can 

similarly be used in assay to determine biological activity, 
including in a panel , of multiple proteins for high- 
throughput screening; to raise antibodies or to elicit 
another immune response; as a reagent (including the labeled 

15 reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological 
fluids; as markers for tissues in which the corresponding 
protein is preferentially expressed (either constitutively 
or at a particular stage of tissue differentiation or 

20 development or in a disease state) ; and, of course, to 
isolate correlative receptors or ligands. Where the protein 
binds or potentially binds to another protein (such as, for 
example, in a receptor-ligand interaction) , the protein can 
be used to identify the other protein with which binding 

25 occurs or to identify inhibitors of the binding interaction- 
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Proteins involved in these binding interactions can also be 
used to screen for peptide or small molecule inhibitors or 
agonists of the binding interaction. 

Any or all of these research utilities are capable 
5 of being developed into reagent grade or kit format for 
commercialization as research products* 

Methods for performing the uses listed above are 
well known to those skilled in the art. References 
disclosing such methods include without limitation 

10 "Molecular Cloning: A Laboratory Manual", 2d ed,. Cold 
Spring Harbor Laboratory Press, Sambrook, J,, E.F. Fritsch 
and T. Maniatis eds., 1989, and "Methods in Enzymology: 
Guide to Molecular Cloning Techniques", Academic Press, 
Berger, S.L. and A.R. Kimmel eds., 1987. 

15 Nu tyitio^^l Uses 

Polynucleotides and proteins of the present 
invention can also be used as nutritional sources or 
supplements. Such uses include without limitation use as a 
protein or amino acid supplement, use as a carbon source, 

20 use as a nitrogen source and use as a source of carbohydrate. 
In such cases the protein or polynucleotide of the invention 
can be added to the feed of a particular organism or can be 
administered as a separate solid or liquid preparation, such 
as in the form of powder, pills, solutions, suspensions or 

25 capsules. In the case of microorganisms, the protein or 
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polynucleotide of the invention can be added to the mediim 
in or on which the microorganism is cultured. 

Cytokine ^^nd Cell Proliferation/Differentiation 

Activity 

5 A protein of the present invention may exhibit 

cytokine, cell proliferation (either inducing or inhibiting) 
or cell differentiation (either inducing or inhibiting) 
activity or may induce production of other cytokines in 
certain cell populations. Many protein factors discovered to 

10 date, including all known cytokines, have exhibited activity 
in one or more factor dependent cell proliferation assays, 
and hence the assays serve as a convenient confirmation of 
cytokine activity. The activity of a protein of the present 
invention is evidenced by any one of a nxmiber of routine 

15 factor dependent cell proliferation assays for cell lines 
including, without limitation, 32D, DA2, DAIG, TIO, B9, 
B9/11, BaF3, MC9/G, M+ (preB M+) , 2E8, RB5, DAI, 123, T1165, 
HT2, CTLL2, TF-1, Mo7e and CMK. 

The activity of a protein of the invention may, 

20 among other means, be measured by the following methods: 

Assays for T-cell or thymocyte proliferation 
include without limitation those described in: Current 
Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, 
D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene 

25 Publishing Associates and Wiley-Interscience (Chapter 3, In 
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Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
1, Immunologic studies in Humans); Takai et al*,. J. Immunol* 
137:3494-3500, 1986; Bertagnolli et al., J, Immunol. 
145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 
5 133:327-341, 1991; Bertagnolli, et al., J. Immunol. 
149:3778-3783, 1992; Bowman et al., J. Immunol. 152: 1756- 
1761, 1994. 

Assays for cytokine production and/or 
proliferation of spleen cells, lymph node cells or 

10 thymocytes include, without limitation, those described in: 
Polyclonal T cell stimulation, Kruisbeek, A.M. and Shevach, 
E.M. In Current Protocols in Immunology. J.E.e.a. Coligan 
eds. Vol 1 pp. 3.12-1-3.12.14, John Wiley and Sons, Toronto. 
1994; and Measurement of mouse and hiaman Interferon y, 

15 Schreiber, R.D. In Current Protocols in Immunology. J.E.e.a. 
Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, 
Toronto. 1994. 

Assays for proliferation and differentiation of 
hematopoietic and lymphopoietic cells include, without 

20 limitation, those described in: Measurement of Human and 
Murine Interleukin 2 and Interleukin 4, Bottomly, K., Davis, 
L.S. and Lipsky, P.E. In Current Protocols in Immunology. 
J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and 
Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173:1205- 

25 1211, 1991; Moreau et al.. Nature 336:690-692, 1988; 
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Greenberger et al., Proc. Natl. Acad. Sex. U.S.A. 80:2931- 
2938, 1983; Measurement of mouse and human interleukin 6- 
. Nordan, R. In Current Protocols in Immunology. J.E.e.a. 
Coligan eds. Vol 1 pp. 6.6,1-6.6.5, John Wiley and Sons, 
5 Toronto. 1991; Smith et al., Proc. Natl. Acad. Sci. U.S.A., 
83:1857-1861, 1986; Measurement of hman Interleukin 11 - 
Bennett, F., Giannotti, J., Clark, S.C. and Turner, K. J, In 
Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 
pp. 6.15.1 John Wiley and Sons, Toronto. 1991; Measurement 

10 of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, 
J,, Clark, S.C. and Turner, K*J. In Current Protocols, in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.13.1, John 
Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens 

15 (which will identify, among others, proteins that affect 
APC-T cell interactions as well as direct T-cell effects by 
measuring proliferation and cytokine production) include, 
without limitation, those described in: Current Protocols in 
Iinmunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. 

20 Margulies, E.M, Shevach, W Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In Vitro 
assays for Mouse Lymphocyte Function; Chapter 6, Cytokines 
and their cellular receptors; Chapter 1, Immunologic studies 
in Humans); Weinberger et al., Proc, Natl. Acad. Sci. USA 

25 77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 
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11:405-411, 1981; Takai et al., J. Immunol. 137:3494-3500, 
1986; Takai et al., J. Immunol. 140:508-512, 1988. 

Immune Stimulatin g or Sup pressing Acti_vli:y 
A protein of the present invention may also 
5 exhibit immune stimulating or immune suppressing activity, 
including without limitation the activities for which assays 
are described herein. A protein may be useful in the 
treatment of various immxine deficiencies and disorders 
(including severe combined immunodeficiency (SCID) ) , e.g., 

10 in regulating {up or down) growth and proliferation of T 
and/or B lymphocytes, as well as effecting the cytolytic 
activity of NK cells and other cell populations. These 
immune deficiencies may be genetic or be caused by viral 
(e.g., HIV) as well as bacterial or fungal infections, or 

15 may result from autoimmune disorders. More specifically, 
infectious diseases causes by viral, bacterial, fungal or 
other infection may be treatable using a protein of the 
present invention, including infections by HIV, hepatitis 
viruses, herpesviruses, mycobacteria, Leishmania spp., 

20 malaria spp. and various fungal infections such as 
candidiasis. Of course, in this regard, a protein of the 
present invention may also be useful where a boost to the 
immune system generally may be desirable, i,e., in the 
treatment of cancer. 

25 Autoimmune disorders which may be treated using a 
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protein of the present invention include, for example, 
connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune 
pulmonary inflammation, Guillain^Barre syndrome, autoimmune 
5 thyroiditis, insulin dependent diabetes .mellitis, myasthenia 
gravis, graft-versus-host disease and autoimmune 
inflammatory eye disease. Such a protein of the present 
invention may also to be useful in the treatment of allergic 
reactions and conditions, such as asthma (particularly 

10 allergic asthma) or other respiratory problems. Other 
conditions, in which immune suppression is desired 
(including, for example, organ transplantation) , may also be 
treatable using a protein of the present invention. 

Using the proteins of the invention it may also be 

15 possible to immune responses, in a number of ways. Down 
regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve 
preventing the induction of an immune response. The 
functions of activated T cells may be inhibited by 

20 suppressing T cell responses or by inducing specific 
tolerance in T cells, or both. Immunosuppression of T cell 
responses is generally an active, non-antigen- specif ic, 
process which requires continuous exposure of the T cells to 
the suppressive agent. Tolerance, which involves inducing 

25 non-responsiveness or anergy in T cells, is distinguishable 
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from immunosuppression in that it is generally antigen- 
specific and persists after exposure to the tolerizing agent 
has ceased. Operationally, tolerance can be demonstrated by 
the lack of a T cell response upon reexposure to specific 
antigen in the absence of the tolerizing agent. 

Down regulating or preventing one or more antigen 
functions (including without limitation B lymphocyte antigen 
functions (such as , for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will 
be useful in situations of tissue, skin and organ 
transplantation and in graf t-versus-host disease (GVHD) • For 
example, blockage of T cell fimction should result in 
reduced tissue destruction in tissue transplantation. 
Typically, in tissue transplants, rejection of the 
transplant is initiated through its recognition as foreign 
by T cells, followed by an immune reaction that destroys the 
transplant. The administration of a molecule which inhibits 
or blocks interaction of a B7 lymphocyte antigen with its 
natural ligand(s) on immune cells (such as a soluble, 
monomeric form of a peptide having B7-2 activity alone or in 
conjunction with a monomeric form of a peptide having an 
activity of another B lymphocyte antigen (e.g^, B7-1, B7-3) 
or blocking antibody), prior to transplantation can lead to 
the binding of the molecule to the natural ligand{s) on the 
immune cells without transmitting the corresponding 
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costimulatory signal. Blocking B lymphocyte antigen function 
in this matter prevents cytokine synthesis by immune cells, 
such as T cells, and thus acts as an immunosuppressant. 
Moreover, the lack of costimulation may also be sufficient 

5 to anergize the T cells, thereby inducing tolerance in a 
subject. Induction of long-term tolerance by B lymphocyte 
antigen-blocking reagents may avoid the necessity of 
repeated administration of these blocking reagents. To 
achieve sufficient immunosuppression or tolerance in a 

10 subject, it may also be necessary to block the function of a 
combination of B lymphocyte antigens. 

The efficacy of particular blocking reagents in 
preventing organ transplant rejection or GVHD can be 
assessed using animal models that are predictive of efficacy 

15 in humans. Examples of appropriate systems which can be used 
include allogeneic cardiac grafts in rats and xenogeneic 
pancreatic islet cell grafts in mice, both of which have 
been used to examine the immunosuppressive effects of 
CTLA.4Ig fusion proteins in vivo as described in Lenschow et 

20 al.. Science 257:789-792 (1992) and Turka et al., Proc. Natl. 
Acad. Sci USA, 89:11102-11105 (1992). In addition, murine 
models of GVHD (see Paul ed.. Fundamental Immunology, Raven 
Press, New York, 1989, pp. 846-847) can be used to determine 
the effect of blocking B lymphocyte antigen function in vivo 

25 on the development of that disease. 
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Blocking antigen fimction may also be 
therapeutically useful for treating autoimmune diseases. 
Many autoimmune disorders are the result of inappropriate 
activation of T cells that are reactive against self tissue 
5 and which promote the production of cytokines and 
autoantibodies involved in the pathology of the diseases. 
Preventing the activation of autoreactive T cells may reduce 
or eliiainate disease symptoms. Administration of reagents 
which block costimulation of T cells by disrupting 

10 receptor: ligand interactions of B lymphocyte antigens can be 
used to inhibit T cell activation and prevent production of 
autoantibodies or T cell-derived cytokines which may be 
involved in the disease process. Additionally, blocking 
reagents may induce antigen-specific tolerance of 

15 autoreactive T cells which could lead to long-term relief 
from the disease. The efficacy of blocking reagents in 
preventing or alleviating autoimmune disorders can be 
detemiined using a number of well -characterized animal 
models of human autoimmune diseases. Examples include murine 

20 experimental autoimmune encephalitis, systemic lupus 
erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine 
autoimmxme collagen arthritis, diabetes mellitus in NOD mice 
and BB rats, and murine experimental myasthenia gravis (see 
Paul ed.. Fundamental Immunology, Raven Press, New York, 

25 1989, pp. 840-856) . 
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Upregulation of an antigen function (preferably a 
B lymphocyte antigen function) , as a means of up regulating 
immune responses, may also be useful in therapy. 
Upregulation of immune responses may be in the form of 
5 enhancing an existing immune response or eliciting an 
initial immune response- For example, enhancing an immune 
response through stimulating B lymphocyte antigen fimction 
may be useful in cases of viral infection. In addition, 
systemic viral diseases such as influenza, the common cold, 

10 and encephalitis might be alleviated by the administration 
of stimulatory forms of B lymphocyte antigens systemically. 

Alternatively, anti-viral immune responses may. be 
enhanced in an infected patient by removing T cells from the 
patient, costimulating the T cells in vitro with viral 

15 antigen-pulsed APCs either expressing a peptide of the 
present invention or together with a stimulatory form of a 
soluble peptide of the present invention and reintroducing 
the in vitro activated T cells into the patient • Another 
method of enhancing anti-viral immune responses would be to 

20 isolate infected cells from a patient, transfect them with a 
nucleic acid encoding a protein of the present invention as 
described herein such that the cells express all or a 
portion of the protein on their surface, and reintroduce the 
transfected cells into the patient. The infected cells would 

25 now be capable of delivering a costimulatory signal to, and 
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thereby activate, T cells in vivo. 

In another application, up regulation or 
enhancement of antigen function (preferably B lymphocyte 
antigen function) may be useful in the induction of tumor 
^ 5 immunity. Tumor cells (e.g., sarcoma, melanoma, lymphoma, 
leukemia, n&5xoblastoma,_^ carcinoma) transf ected with a 
nucleic acid encoding at least one peptide of the present 
invention can be administered to ^ a subject to overcome 
tumor-specific tolerance in the subject. If desired, the 
10 timaor cell can be transf ected to express a combination of 
peptides. For example, tumor cells obtained from a patient 
can be transf ected ex vivo with an expression vector 
directing the expression of a peptide having B7-2--like 
activity alone, or in conjunction with a peptide having 37- 

15 l~like activity and/or B7-3-like activity. The transfected 

( 

tumor cells are returned to the patient to result in 
expression of the peptides on the surface of the transfected 
cell* Alternatively/ gene therapy techniques can be used to 
target a tumor cell for transf ection in vivo. 

20 The presence of the peptide of the present 

invention having the activity of a B lymphocyte antigen (s) 
on the surface of the tumor cell provides the necessary 
costimulation signal to T cells to induce a T cell mediated 
iiomune response against the transfected tumor cells. In 

25 addition, tumor cells which lack MHC class I or MHC class II 
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molecules, or which fail to reexpress sufficient amounts of 
MHC class I or MHC class II molecules, can be transfected 
with nucleic acid encoding all or a portion of (e.g., a 
cytoplasmic-domain truncated portion) of an MHC class I a 
5 chain protein and 13 ^ microglobulin protein or an MHC class 
II a chain protein and an MHC cIblSS II J3 chain protein to 
thereby express MHC class I or MHC class II proteins on the 
cell surface- Expression of the appropriate class I or class 
II MHC in conjunction with a peptide having the activity of 

10 a B lymphocyte antigen {e.g., B7-1, B7-2, B7-3) induces a T 
cell mediated immune response against the transfected tumor 
cell. Optionally, a gene encoding an antisense construct 
which blocks expression of an MHC class II associated 
protein, such as the invariant chain, can also be 

15 cotransfected with a DMA encoding a peptide having the 
activity of a B lymphocyte antigen to promote presentation 
of tumor associated antigens and induce tvimor specific 
immunity. Thus, the induction of a T cell mediated immune 
response in a human subject may be sufficient to overcome 

20 tumor-specific tolerance in the subject* 

The activity of a protein of the invention may, 
among other means, be measured by the following methods: 

Suitable assays for thymocyte or splenocyte 
cytotoxicity include, without limitation, those described 

25 in: Current Protocols in Immxanology, Ed by J. E, Coligan, 
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A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. 
Greene Publishing Associates and Wiley-Interscience {Chapter 
3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Iimavmologic studies in Humans); Herrmann et al., 

5 Proc, Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et 
al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 
137:3494-3500, 1986; Takai et al., J. Iiraaunol. 140:508-512, 
1988; Herrraann et al., Proc. Natl. Acad. Sci. USA 78:2488- 

10 2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 
1982; Handa et al., J. Immunol. 135:1564-1572, 1985; • Takai 
' et al., J. Immunol. 137:3494-3500, 1986; Bowmanet al., J. 
Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 
1988; Bertagnolli et al.. Cellular Immunology 133:327-341, 

15 1991; Brown et al., J. Imouunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin 
responses and isotype switching (which will identify, among 
others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without 

20 limitation, those described in: Maliszewski, J. Immunol. 
144:3028-3033, 1990; and Assays for B cell function: In 
vitro antibody production, Mond, J.J. and Brunswick, M. In 
Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 
pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

25 Mixed lymphocyte reaction (MLR) assays (which will 
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identify, among others, proteins that generate predominantly 
Thl and CTL responses) include/ without limitation, those 
described in: Current Protocols in Immunology, Ed by J. E. 
Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W 
5 Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies 
in Humans); Takai et al,, J. Immunol. 137:3494-3500, 1986; 
Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et 
. 10 al,, J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will 
identify, among others, proteins expressed by dendritic 
cells that activate naive T-cells) include, without 
limitation, those described in: Guery et al., J. Immunol. 
15 134:536-544, 1995; Inaba et al.. Journal of Experimental 
Medicine 173:549-559, 1991; Macatonia et al.. Journal of 
Immunology 154:5071-5079, 1995; Porgador et al.. Journal of 
Experimental Medicine 182:255-260, 1995; Nair et al.. 
Journal of Virology 67:4062-4069, 1993; Huang et al., 
20 Science 264:961-965, 1994; Macatonia et al.. Journal of 
Experimental Medicine 169:1255-1264/ 1989; Bhardwaj et al.. 
Journal of Clinical Investigation 94:797-807, 1994; and 
Inaba et al.. Journal of Experimental Medicine 172:631-640, 
1990. 

25 Assays for lymphocyte survival/apoptosis (which 
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will identify, among others, proteins that prevent apoptosis 
after superantlgen induction and proteins that regulate 
lyit^hocyte homeostasis) include, without limitation, those 
described in: Darzynkiewicz et al.. Cytometry 13:795-808, 
5 1992; Gorczyca et al*. Leukemia 7:659-670, 1993; Gorczyca et 
al., Cancer Research 53:1945-1951, 1993; Itoh et al.. Cell 
66:233-243, 1991; Zacharchuk, Journal of Immunology 
145:4037-4045, 1990; Zamai et al;. Cytometry 14:891-897, 
1993; Gorczyca et al.. International Journal of Oncology 

10 1:639-648, 1992. 

Assays for proteins that influence early steps of 
T-cell commitment and development include, without 
limitation, those described in: Antica et al., Blood 84: Hi- 
ll?, 1994; Fine et al.. Cellular Immunology 155:111-122, 

15 1994; Galy et al.. Blood 85:2770-2778, 1995; Toki et al., 
Proc. Nat, Acad Sci. USA 88:7548-7551, 1991. 

Hematopoi esis Re gulating Activity 

A protein of the present invention may be useful 
in regulation of hematopoiesis and, consequently, in the 

20 treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming 
cells or of factor-dependent cell lines indicates 
involvement in regulating hematopoiesis, e.g. in supporting 
the growth and proliferation of erythroid progenitor cells 

25 alone or in combination with other cytokines, thereby 
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indicating utility, for example, in treating various anemias 
or for use in conjunction with irradiation/ chemotherapy to 
stimulate the production of erythfoid precursors and/or 
erythroid cells; in supporting the growth and proliferation 
5 of myeloid cells such as granulocytes and 
monocytes/macrophages (i.e., traditional CSF activity) 
useful, for example, in conjunction with chemotherapy to 
prevent or treat consequent myelo-suppression; in supporting 
the growth and proliferation of megakaryocytes and 

10 consequently of platelets thereby allowing prevention or 
treatment of various platelet disorders such as 
thrombocytopenia, and generally for use in place of or 
complementary to platelet transfusions; and/or in supporting 
the growth and proliferation of hematopoietic stem cells 

15 which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic 
utility in various stem cell disorders (such as those 
usually treated with transplantation, including, without 
limitation, aplastic anemia ' and paroxysmal nocturnal 

20 hemoglobinuria), as well as in repopulating the stem cell 
compartment post irradiation/chemotherapy, either in-vivo or 
ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell 
transplantation (homologous or heterologous)) as normal 

25 cells or genetically manipulated for gene therapy. 
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The activity of a protein of the invention may, 
among other means, be measured by the following methods: 

Suitable assays for proliferation and 
differentiation of various hematopoietic lines are cited 
5 above . 

Assays for embryonic stem cell differentiation 
(which will identify, among others, proteins that influence 
embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular 

10 Biology 15:141-151, 1995; Keller et al.. Molecular and 
Cellular Biology 13:473-486, 1993; McClanahan et al.. Blood 
81:2903-2915, 1993. 

Assays for stem cell survival and differentiation 
(which will identify, among others, proteins that regulate 

15 lympho-hematopoiesis) include, ' without limitation, those 
described in: Methylcellulose colony forming assays, 
Freshney, M.G. In Culture of Hematopoietic Cells. R.I. 
Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New 
York, NY. 1994; Hirayaiaa et al., Proc. Natl. Acad. Sci. USA 

20 89:5907-5911, 1992; Primitive hematopoietic colony forming 
cells with high proliferative potential, McNiece, I.K. and 
Briddell, R.A. In Culture of Hematopoietic Cells. R.I. 
Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New 
York, NY. 1994; Neben et al.. Experimental Hematology 

25 22:353-359, 1994; Cobblestone area forming cell assay, 
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Ploemacher, R*E- In Culture of Hematopoietic Cells. R.I, 
Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, Inc., New 
York, NY. 1994; Long term bone marrow cultures in the 
presence of stromal cells, Spooncer, E., Dexter, M. and 
Allen, T. In Culture of Hematopoietic Cells. R.I, Freshney, 
et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, NY. 
1994; Long term culture initiating cell assay, Sutherland, 
H.J. In Culture of Hematopoietic Cells. R.I. Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, NY. 1994. 
Ti.<i.qne Growth Activity 

A protein of the present invention also may have 
utility in compositions used for bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as well 
as for wound healing and tissue repair and replacement, and 
in the treatment of burns, incisions and ulcers. 

A protein of the present invention, which induces 
cartilage and/or bone growth in circumstances where bone is 
not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and 
other animals. Such a preparation employing a protein of the 
invention may have prophylactic use in closed as well as 
open fracture reduction and also in the ^improved fixation of 
artificial joints. De novo bone formation induced by an 
osteogenic agent contributes to the repair of congenital, 
trauma induced, or oncologic resection induced craniofacial 
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defects, and also is useful in cosmetic plastic surgery. 

A protein of this invention may also be used iri 
the tjreatment of periodontal disease, and in other tooth 
repair processes. Such agents may provide an environment to 
5 attract bone-forming cells, stimulate growth of bone-foming 
cells or induce differentiation of progenitors of bone- 
foinaing cells. A protein of the invention may also be useful 
in the treatment of osteoporosis or osteoarthritis, such as 
through stimulation of bone and/or cartilage repair or by 

10 blocking Inflammation or processes of tissue destruction 
{collagenase activity, osteoclast activity, etc.) mediated 
by inflammatory processes. 

Another category of tissue regeneration activity 
that may be attributable to the protein of the present 

15 invention is tendon/ ligament formation, A protein of the 
present invention, which induces tendon/ligament-li3ce tissue 
or other tissue formation in circumstances where such tissue 
is not normally formed, has application in the healing of 
tendon or ligament tears, deformities and other tendon or 

20 ligament defects in humans and other animals. Such a 
preparation employing a tendon/ligament-like tissue inducing 
protein may have prophylactic use in preventing damage to 
tendon or ligament tissue, as well as use in the improved 
fixation of tendon or ligament to bone or other tissues, and 

25 in repairing defects to tendon or ligament tissue. De novo 
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tendon/ligament-like tissue formation induced by a 
composition of the present invention contributes to the 
repair of congenital, trauma induced/ or other tendon or 
ligament defects of other origin, and is also useful in 
5 cosmetic plastic surgery for attacliment or repair of tendons 
or ligaments. The compositions of the present invention may 
provide an environment to attract tendon or ligament-forming 
cells, stimulate growth of tendon- or ligament- forming cells, 
induce differentiation of progenitors of tendon- or 

10 ligament- foming cells, or induce growth of tendon/ ligament 
cells or progenitors ex vivo for return in vivo to effect 
tissue repair. The compositions of the invention may also be 
useful in the treatment of tendinitis, carpal tunnel 
syndrome and other tendon or ligament defects. The 

15 compositions may also include an appropriate matrix and/or 
sequestering agent as a carrier as is well known in the art. 

The protein of the present invention may also be 
useful for proliferation of neural cells and for 
regeneration of nerve and brain tissue, i,e. for the 

20 treatment of central and peripheral nervous system diseases 
and neuropathies, as well as mechanical and traumatic 
disorders, which involve degeneration, death or trauma to 
neural cells or nerve tissue. More specifically, a protein 
may be used in the treatment of diseases of the peripheral 

25 nervous system, such as peripheral nerve injuries. 
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peripheral neuropathy and localized neuropathies, and 
central nervous system diseases, such as Alzheimer's, 
Parkinson's disease, Huntington's . disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further 
5 conditions which may be treated in accordance with the 
present invention include mechanical and traumatic disorders, 
such as spinal cord disorders, head trauma cind 
cerebrovascular diseases such as stroke. Peripheral 
neuropathies resulting from chemotherapy or other medical 
10 therapies may also be treatable using a protein of the 
invention . 

Proteins of the invention may also be useful to 
promote better or faster closure of non-healing wounds, 
including without limitation pressure ulcers, ulcers 

15 associated with vascular insufficiency, surgical and 
traumatic wounds and the like- 

It is expected that a protein of the present 
invention may also exhibit activity for generation or 
regeneration of other tissues, such as orgcins (including, 

20 for exair^^l^/ pancreas, liver, intestine, kidney, skin, 
endothelium) , muscle (smooth, skeletal or cardiac) and 
vascular (including vascular endothelium) tissue, or for 
promoting the growth of cells comprising such tissues- Part 
of the desired effects may be by inhibition or modulation of 

25 fibrotic scarring to allow normal tissue to regenerate. A 
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protein of the invention may also exhibit angiogenic 
activity. 

A protein of the present invention may also be 
useful for gut protection or regeneration and treatment of 
5 lung or liver fibrosis^ reperfusion injury in various 
tissues, and conditions resulting from systemic cytokine 
damage . 

A protein of the present invention may also be 
useful for promoting or inhibiting differentiation of 

10 tissues described above from precursor tissues or cells; or 
for inhibiting the grovdih of tissues described above. 

The activity of a protein df the invention may, 
among other means, be measured by the following methods: 

Assays for tissue generation activity include, 

15 without limitation, those described in: International Patent 
Publication No. W095/16035 (bone, cartilage, tendon); 
International Patent Publication No. WO95/05846 (nerve, 
neuronal); International Patent Publication No. WO91/07491 
(skin, endotheliTim ) . 

20 Assays for wound healing activity include, without 

limitation, those described in: Winter, Epidermal Wound 
Healing, pps, 71-112 (Maibach, HI and Rovee, DT, eds.). Year 
Book Medical Publishers, Inc., Chicago, as modified by 
Eaglstein and Mertz, J. Invest. Dermatol 71:382-84 (1978). 

25 Activin/Ii^bibin Activity 



wo 01/49728 



PCT/JPOO/09359 



A protein of the present invention may also 
exhibit activin- or inhibin-related activities. Inhibins are 
characterized by their ability to inhibit the release of 
follicle stimulating hormone (FSH) , while activins and are 
5 characterized by their ability to stimulate the release of 
follicle stimulating hormone (FSH) . Thus, a protein of the 
present invention, alone or in heterodimers with a member of 
the inhibin a family, may be useful as a contraceptive based 
on the ability of inhibins to decrease fertility in female 

10 mammals and decrease spermatogenesis in laale mammals. 
Administration of sufficient amounts of other inhibins can 
induce infertility in these mammals. Alternatively, the 
protein of the invention, as a homodimer or as a heterodimer 
with other protein subunits of the inhibin- |3 group, may be 

15 useful as a fertility inducing therapeutic, based upon the 
ability of activin molecules in stimulating FSH release from 
cells of the anterior pituitary. See, for example. United 
States Patent 4,798,885. A protein of the invention may also 
be useful for advancement of the onset of fertility in 

20 sexually immature mammals, so as to increase the lifetime 
reproductive performance of domestic animals such as cows, 
sheep and pigs. 

The activity of a protein of the invention may, 
among other means, be measured by the following methods: 

25 Assays for activin/inhibin activity include. 
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without limitation, those described in: Vale et al., 
Endocrinology 91:562-572, 1972; Ling et al.. Nature 321:779- 
782, 1986; Vale et al*. Nature 321:776-779, 1986; Mason et 
al.. Nature 318:659-663^ 1985; Forage et al., Proc, Natl- 
5 Acad. Sci. USA 83:3091-3095, 1986. 

Chejnotac tic/Chemokinetic A ctivity 

A protein of the present invention may have 
chemotactic or chemokinetic activity (e»g*, act as a 
chemokine) for mainiaalian cells, including, for example, 

10 monocytes, fibroblasts, neutrophils, T-cells, mast cells, 
eosinophils, epithelial and/or endothelial cells. 
Chemotactic and chemokinetic proteins can be used to 
mobilize or attract a desired cell population to a desired 
site of action. Chemotactic or chemokinetic proteins provide 

15 particular advantages in treatment of wounds and other 
trauma to tissues, as well as in treatment of localized 
infections. For example, attraction of lymphocytes, 
monocytes or neutrophils to tumors or sites of infection may 
result in improved immune responses against the tumor or 

20 infecting agent. 

A protein or peptide has chemotactic activity for 
a particular cell population if it can stimulate, directly 
or indirectly, the directed orientation or movement of such 
cell population. Preferably, the protein or peptide has the 

25 ability to directly stimulate directed movement of cells. 
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P?hether a particular protein has chemotactic activity for a 
population of cells can be readily determined by employing 
such protein or peptide in any known assay for cell 
chemo taxis, 

5 The activity of a protein of the invention may, 

among other means, be measured by the following methods: 

Assays for chemotactic activity (which will 
identify proteins that induce or prevent chemotaxis) consist 
of assays that measure the ability of a protein to induce 

10 the migration of cells across a membrane as well as the 
ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for 
movement and adhesion include, without limitation, those 
described in: Current Protocols in Immunology, Ed by J.E. 

15 Coligan, A.M. Kruisbeek, D-H. Margulies, E*M. Shevach, 
W.Strober^ Pub, Greene Publishing Associates and Wiley- 
Interscience (Chapter 6.12, Measurement of alpha and beta 
Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 
95:1370-1376, 1995; Lind et al. AFMIS 103:140-146, 1995; 

20 Muller et al Eur. J. Immunol. 25: 1744-1748; Gruber et al. J. 
of Immunol. 152:5860-5867, 1994; Johnston et al, J. of 
Immunol. 153: 1762-1768, 1994. 

Hemostatir! and Throitvbolyt ic Activity 

A protein of the invention may also, exhibit 

25 hemostatic or thrombolytic activity. As a result, such a 
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protein is expected to be useful in treatment of various 
coagulation disorders (including hereditary disorders, such 
as hemophilias) or to enhance coagulation and other 
hemostatic events in treating wounds resulting from trauma, 
5 surgery or other causes. A protein of the invention may also 
be useful for dissolving or inhibiting formation of 
- thromboses and for treatment and prevention of conditions 
resulting therefrom (such as, for exairqple, infarction of 
cardiac and central nervous system vessels (e.g., stroke)). 

10 The activity of a protein of the invention may, 

among other means, he measured by the following methods; 

Assay for hemostatic and thrombolytic activity 
include, without limitation, those described in: Linet et 
al., J. Clin- Pharmacol. 26:131-140, 1986; Burdick et al., 

15 Thrombosis Res. 45:413-419, 1987; Himphrey et al.. 
Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 35:467- 
474, 1988. 

Rece ptor /Ligand Activity 

A protein of the present invention may also 
20 demonstrate activity as receptors, receptor ligands or 
inhibitors or agonists of receptor/ligand interactions. 
Examples of such receptors and ligands include, without 
limitation, cytokine receptors and their ligands, receptor 
kinases and their ligands, receptor phosphatases and their 
25 ligands, receptors involved in cell-cell interactions and 
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their ligands (including without limitationy. cellular 
adhesion molecules (such as select ins, integrins and their 
ligands) and receptor /ligand pairs involved in antigen 
presentation, antigen recognition and development of 
5 cellular and humoral immune responses) , Receptors and 
ligands are also useful for screening of potential peptide 
or sitiall molecule inhibitors of the relevant receptor /ligand 
interaction. A protein of the present invention (including, 
without limitation, fragments of receptors and ligands) may 

10 themselves be useful as inhibitors of receptor/ligand 
interactions - 

The activity of a protein of the invention may, 
among other means, be measured by the following methods: 

Suitable assays for receptor-ligand activity 

15 include without limitation those described in: Current 
Protocols in Immunology, Ed by J,E. Coligan, A.M. Kruisbeek, 
D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene 
Publishing Associates and Wiley-Interscience (Chapter 7,28, 
Measurement of Cellular Adhesion under static conditions 

20 7.28.1-7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 
84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1145- 
1156, 1988; Rosenstein et al., J- Exp. Med. 169:149-160 
1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 
1994; Stitt et al., Cell 80:661-670, 1995. 

25 ^ ti- lxxfld m^^toxY ActAvit^ 
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Proteins of the present invention may also exhibit 
anti-inflainmatory activity. The anti-inf lairaoatory activity 
may be achieved by providing a stimulus to cells involved in 
the inflammatory response, by inhibiting or promoting cell- 
cell interactions (such as, for example, cell adhesion) , by 
inhibiting or promoting chemotaxio of cells involved in the 
inflammatory ^ process^ inhibiting or promoting cell 
extravasation, or by stimulating or suppressing production 
of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities 
can be used to treat inflammatory conditions including 
chronic or acute conditions) , including without limitation 
inflammation associated with infection (such as septic shock, 
sepsis or systemic inflammatory response syndrome (SIRS) ) , 
ischemia-^reperfusion injury, endotoxin lethality, arthritis, 
complement-mediated hyperacute rejection, nephritis, 
cytokine or chemokine-induced lung injury, inflammatory 
bowel disease, Crohn's disease or resulting from over 
production of cytokines such as TNF or IL-1. Proteins of the 
invention may also be useful to treat anaphylaxis and 
hypersensitivity to an antigenic substance or material. 

Tumnr Inhibition Act lvj-tv 

In addition to the activities described above for 
immunological treatment or prevention of tumors, a protein 
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of the invention may exhibit other anti-tumor activities. A 
protein may inhibit tumor growth directly or indirectly 
(such as, for example, via ADCC) . A protein may exhibit its 
tumor inhibitory activity by acting on tumor tissue or tumor 
5 precursor tissue, by inhibiting formation of tissues 
necessary to support tumor growth (such as, for example, by 
inhibiting angiogenesis) , by causing production of other 
factors, agents or cell types which inhibit tiomor growth, or 
by suppressing, eliminating or inhibiting factors, agents or 
10 cell types which promote tumor growth. 

Other Activities 

A protein of the invention may also exhibit one or 
more of the following additional activities or effects: 
inhibiting the growth, infection or fimction of, or killing, 

15 infectious agents, including, without limitation, bacteria, 
viruses, fungi and other parasites; effecting (suppressing 
or enhancing) bodily characteristics, including, without 
limitation, height, weight, hair color, eye color, skin, fat 
to lean ratio or other tissue pigmentation, or organ or body 

20 part size or shape (such as, for example, breast 
augmentation or diminution, change in bone form or shape) ; 
effecting biorhythms or cardiac cycles or rhythms; effecting 
the fertility of male or feiaale subjects; effecting the 
metabolism, catabolism, anabolism, processing, utilization. 
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storage or elimination of dietary fat, lipid, protein, 
carbohydrate, vitamins, lainerals, cof actors or other 
nutritional factors or comjponeht (s) ; effecting behavioral 
characteristics, including, without limitation, appetite, 
5 libidO/ stress, cognition (including cognitive disorders), 
depression (including depressive disorders) and violent 
behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of 
embryonic stem cells in lineages other than hematopoietic 

10 lineages; hormonal or endocrine activity; in the case of 
enzymes, correcting deficiencies of the enzyme and treating 
deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis) ; immunoglobulin- 
like activity (such as, for example, the ability to bind 

15 antigens or complement) ; and the ability to act as an 
antigen in a vaccine composition to raise an immune response 
against such protein or another material or entity which is 
cross-reactive with such protein. 

Examples 

20 The present invention is specifically illustrated 

in more detail by the following Examples, but Examples are 
not intended to restrict the present invention. The basic 
procedures with regard to the recombinant DNA. and the 
enzymatic reactions were carried out according to the 



wo 01/49728 



PCT/JPOO/09359 



54 



literature ["Molecular Cloning. A Laboratory Manual", Cold 
Spring Harbor Laboratory, 1989] . Unless otherwise stated, 
restriction enzymes and various modifying enzymes to be used 
were those available from Takara Shuzo. The buffer 
5 compositions and the reaction conditions for each of the 
enzyme reactions were ' as described in the attached 
instructions* The cDNA synthesis was carried out according 
to the literature [Kato, S- et - al.. Gene 150: 243-250 
(1994)]. 

10 (1) Selection of cDNAs Encoding Proteins Having 

Hydrophobic Domains 

The cDNA library of epidermoid carcinoma cell line 
KB {W098/11217) , and the cDNA libraries constructed from 
human kidney mRNA (Clontech) and human umbilical cord blood 

15 mRNA were used as cDNA libraries. 

Full-length cDNA clones were selected from the 
respective libraries and the whole base sequences thereof 
were determined to construct a homo-protein cDNA bank 
consisting of the full-length cDNA clones • The 

20 hydrophobicity/hydrophilicity profiles were determined for 
the proteins encoded by the full-length cDNA clones 
registered in the homo-protein cDNA bank by the Kyte- 
Doolittle method [Kyte, J- & Doolittle, R. F., J, Mol. Biol, 
157: 105-132 (1982)] to examine the presence or absence of a 

25 hydrophobic domain. A clone that has a hydrophobic region 
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being assumed as a secretory signal or a transmembrane 
domain in the amino acid sequence of the encoded protein was 
selected as a clone candidate. 

(2) Protein Synthesis by In Vitro Translation 

5 The plasmid vector bearing the cDNA of the present 

invention was used for in vitro transcription/ translation 
with a TnT rabbit reticulocyte lysate kit (Promega) . In this 
case, ['^S] methionine was added to label the expression 
product with a radioisotope. Each of the reactions was 

10 carried out according to the protocols attached to the kit. 
Two micrograms of the plasmid was subjected to the reaction 
at 30°C for 90 minutes in the reaction solution of a total 
volume of 25 ill containing 12.5 ul ]i of T^T rabbit 
reticulocyte lysate, 0.5 pi of a buffer solution (attached 

15 to the kit) , 2 ul of an amino acid mixture (without 
methionine), 2 ]xl of [^^S] methionine (Amersham) (0.37 MBq/jil), 
0.5 pi of T7 RNA polymerase, and 20 U of RNasin, The 
experiment in the presence of a membrane system was carried 
out by adding 2.5 pi of canine pancreas microsome fraction 

20 (Promega) to the reaction system. 2 pi of the SDS sampling 
buffer (125 kM Tris-hydrochloride buffer, pH 6.8, 120 mM 2- 
mercaptoethanol, 2% SDS solution, 0.025% Bromophenol Blue 
and 20% glycerol) was added to 3 pi of the reaction solution. 
The resulting mixture was heated at 95*^0 for 3 minutes and 

25 then subjected to SDS-polyacrylamide gel electrophoresis. 
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The molecular weight of the translation product was 
determined by carrying out the autoradiography. 
(3) Expression in C0S7 

Escherichia coli cells harboring the expression 
5 vector for the protein of the present invention were 
cultured at 37 °C for 2 hours in 2 ml of the 2 x YT culture 
medium containing 100 |ig/ml of ampicillin, the helper phage 
M13K07 (50 M 1) was added thereto, and the cells were then 
cultured at 37 ''C overnight. Single-stranded phage particles 

10 were obtained by polyethylene glycol precipitation from a 
supernatant separated by centrifugation. The particles were 
suspended in 100 jil of 1 irM Tris-0.1 mM EDTA, pH 8 (TE) . 

The cultured cells derived from monkey kidney, 
C0S7, were cultured at 37 ""C in the presence of 5% CO^ in the 

15 Dulbecco's modified Eagle's medium (DMEM) containing 10% 
fetal calf serum. 1 x 10^ C0S7 cells were inoculated into a 
6-well plate (Miinc, well diameter: 3 cm) and cultured at 
37 °C for 22 hours in the presence of 5% CO2. After the medium 
was removed, the cell surface was washed with a phosphate 

20 buffer solution followed by IMEM containing 50 mM Tris-- 
hydrochloride (pH 7.5) (TIMBM) . A suspension containing 1 pi 
of the single-stranded phage suspension, 0.6 ml of the DMEM 
medium and 3 pi of TRANSFECTAM"^ (IBF) was added to the cells 
and the cells were cultured at 37 °C for 3 hours in the 

25 presence of 5% CO2. After the sample solution was removed. 
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the cell surface wais washed with TDMEM, 2 ml per well of 
DMEM containing 10% fetal calf serum was added, and the 
cells were cultured at 37 °C for 2 days in the presence of 5% 
COj. After the medium was exchanged for a medium containing 
5 [^^S] cysteine or [^^S] methionine, the cells were cultured for 
one hour. After the medium and the cells were separated each 
other by centrifugation, proteins in the medium fraction and 
the cell membrane fraction were subjected to SDS-PAGE. 
(4) Preparation of Antibodies 
10 A plasmid vector containing the cDNA of the 

present invention was dissolved in a phosphate buffer 
solution (PBS: 145 itiM NaCl, 2.68 mM KCl, 8,09 mM Na2HP04, 2 
itiM KH2PO4, pH 7.2) at a concentration of 2 \xg/]il. 25 yl each 
(a total of 50 of the thus prepared plasmid solution in 

15 PBS was injected into the right and left musculi quadriceps 
femoris of three mice (ICR line) using a 26 guage needle. 
After similar injections were repeated for one month at 
intervals of one week, blood was collected. The collected 
blood was stored at 4-0 overnight to coagulate the blood, 
20 and then centrifuged at 8,000 x g for five minutes to obtain 
a supernatant. NaNa was added to the supernatant to a 
concentration of 0.01% and the mixture was then stored at 
4''C. The generation of an antibody was confirmed by 
imraunostaining of COSl cells into which the corresponding 
25 vector had been introduced, or by Western blotting using a 
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cell lysate or a secreted product. 
(5) Clone Examples 

<HP03613> (SEQ ID NOS: 1, 11/ and 21) 
Determination of the whole base sequence of the 
cDNA insert of clone HP03613 obtained from cDNA library of 
human kidney revealed the structure consisting of a 337'-bp 
5' -untranslated region, a 1737-bp ORF, and a 791-bp 3'- 
untranslated region* The ORF encodes a protein consisting of 
578 amino acid residues and there existed eleven putative 
transmembrane domains. Figure 1 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of high molecular weight. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to mouse organic cation transporter- 
like protein (Accession No. BAA23875) . Table 2 shows the 
comparison between amino acid sequences of the human protein 
of the present invention (HP) and mouse organic cation 
transporter-like protein (MT) . Therein, the marks of *, 
and . represent a gap, an amino acid residue identical with 
that of the protein of the present invention, and an amino 
acid residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
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of 70.4% in the entire region. 
Table 2 

5 HP MFSELU)LVGGLGRFQVLQTMAlMSimCTQSMLENFSAAVPSHRCWAPLLDNSTAQ 

*5M:* ******=W:. . **. **. ..*.*. **. ********** ****. *******. * 

MT MAFPELU)RV«3LGiarQLFQTVALVTPILWVnQNMLENFSAAVPHHRCWVPLLDNSTSQ 

HP ASILGSLSPEALLAISIPPGPNQRPHQCRRFRQPQWQLLDPMTATSWSEADTEPCVDGW 
10 *** *. *. *, . ***. ***+♦*. *. **** ********* . . *****, **, *. **** *** 

MT ASIPGDLGPDVLLAVSIPPGPDQQPHQCLRFRQPQWQLTESNATATNWSDAATEPCBDGW 
HP YYDRSIFTSTIVAKWNLVCDSHALKPMAQSIYLAGILVGAAACGPASDRFGRRLVLTWSY 

*♦*. *. * ****. , *, ***. *. **. ******. *********. **. ***4(**:|c|: ****** 

15 MT VYDHSTFRSTIVTTWDLVCNSQALRPMAQSIFLAGILVGAAVCGHASDRFGRRRVLTWSY 
HP LQMAVMGTAAAFAPAFPVYCLFRFLLAFAVAGVMMNTGTLRRSLTWRHAGGLHAGSRAEP 
MT LLVSVSGTAAAFMPTFPLYCLFRFLLASAVAGVMMNTAS ~ 

20 

HP LGLLAVMEWTAARARPLVMTLNSLGFSPGHGLTAAVAYGVRDWTLLQLWSVPFFLCFLY 

. ****. * 4c:jc*****, ******, **. . ******, * . **+^ iip^^ i^Hn *, * 

MT LLMEWTSAQGSPLVMTLNALGFSFGQVLTGSVAYGYRSWRMLQUVSAPFFLFFVY 



25 HP SWWLAESARWLLTTGRLDWGLQELWRVAAINGKGAVQDTLTPEVLLSAMREELSMGQPPA 
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*. *. ** ***** ****. ♦ . * **** *** *** ***.... * 

MT SWWLPESARWLITVGKLDQGLQELQRVAAVNRRKABGDTLTMEVLRSAMEEEPSRDKAGA 

HP SLGTLLRMPGLRFRTCISTLCWFAFGFTFFGLALDLQALGSNIFLLQMFIGWDIPAKMG 
5 **** ** ** **********.***************** .**.*+.*.* ♦ 

MT SLGTLLHTPGLRHRTIISMLCWFAFGFTFYGLALDLQALGSNIFLLQALIGIVDFPVKTG 

HP ALLLI^HLGRRPTLAASLLLAGLCILANTLVPHEMGALRSALAVLGLGGVGAAFTCITIY 

.. *.*,*****.*.****.**. ******'M!*** .*.*♦*****. 

10 MT SLLLISRLGRRLCQVSFLVLPGLCILSNILVPHGMGVLRSALAVLGLGCLGGAFTCITIF 
HP SSELFPTVLRMTAVGLGQMAARGGAILGPLVRLLGVHGPWLPLLVYGTVPVLSGLAALLL 

******* *. ******. **********. *. *. ******. ************ 

MT SSELFPTVIRMTAVGLCQVAARGGAMLGPLVRLLGVYGSWMPLLVYGWPVLSGLAALLL 

15 

HP PETQSLPLPDTIQDVQNQAVKKATHGTLGNSVLKSTQF 
***_*********.*,*.***.**.* ,.*.*.**.. 
MT PETKNLPLPDTIQDIQKQSVKKVTHDTPDGSILMSTRL 

20 The search of the GenBank using the base sequences 

of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for exan^ile. 
Accession No. AI792236) . However, since they are partial 
sequences, it can not be judged whether or not they encode 

25 the same protein as the protein of the present invention. 
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<HP03700> (SEQ ID NOS: 2, 12^ and 22) 
Determination of the whole base sequence of the 
cDNA insert of clone HP03700 obtained from cDNA library of 
human kidney revealed the structure consisting of a 45-bp 
5 5' -untranslated region, a 732-bp ORF, and a 2546-bp 3^- 
untranslated region. The ORF encodes a protein consisting of 
243 amino acid residues and there existed three putative 
transmembrane domains. Figure 2 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 

10 Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 27 kDa that was somewhat larger than the molecular weight 
of 25,561 predicted from the ORF. 

The search of the' protein database using the amino 

15 acid sequence of the present protein revealed that the 
protein was similar to mouse yolk sac permease-like 
molecule 1 (Accession No. A?yv92292) . Table 3 shows the 
conqparison between amino acid sequences of the human protein 
of the present invention (HP) and mouse yolk sac permease- 

20 like molecule 1 (MY) . Therein, the marks of *, and . 
represent a gap, an amino acid residue identical with that 
of the protein of the present invention, and an amino acid 
residue siaiilar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 

25 of 74.5% in the N-terminal region of 231 amino acid residues. 
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Table 3 

HP MSRSPLNPSQLRSVGSQDALAPLPP— PAPQNPSTHSWDP-LCGSLPWGLSCLLALQHVL 

5 * . * * * i|e*. ***** *, ****. . **. . . *. , *********** * 

MY MSRSPLHPIPLLSEGYQDTPAPLPPLLPPLQNPSSRSWASRVFGPSTWGLSCLLALQHFL 
HP VmSLLCVSm.LLLCSI^PGGLSYSPSQLUSSFFSCGMSTILQ™iGSRLPLVQAPSLE 

*4c** . ****** . *. ******. *. ***********. *♦. ***********, ****** 

10 MY VLASLLWASHLLLLHGLPPGGLSYPPAQLLASSFFSCGLSTVLQTffMGSRLPLIQAPSLE 
HP FLIPALVLTSQKLPRAIQTPGNSSLMLHLCR-GPSCHGLGHW^fTSLQEVSGAYWSGLLQ 

***4c****4:, **** . . . ****. ***.**. . , *****. *****. ************* 

MY FLIPALVLTNQKLPLTTKTPGNASLSLPLCSLTRSCHGLELWNTSLREVSGAVWSGLLQ 

15 

HP GMMGLLGSPGHVFPHCGPLVLAPSLVVAGLSAHREVAQFCFTHffGLALLYVSPERRGMVP 
* _ **** **, ***. ******************. ****** . ******* _ 

MY GTIGLLGVPGRVFPYCGPLVLAPSLWAGLSAHKEVAQFCSAHWGLALLLILLMWCSQH 

20 HP SGGVWGD 

MY LGSCQIPLCSWRPSSTSTHICIPVFRLLSVLAPVACVWFISAFVGTSVIPLQLSEPSDAP 

The search of the GenBank using the base sequences 
25 of the present cDNA has revealed the registration of 
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sequences that shared a homology of 90% or more (for example. 
Accession No. AW167520) • However, since they are partial 
sequences, it can not be judged whether or not they encode 
the same protein as the protein of the present invention. 
5 <HP03935> (SEQ ID NOS: 3, 13, and 23) 

Determination of the whole base sequence of the 
cDNA insert of clone HP03935 obtained from cDNA library of 
human kidney revealed the structure consisting of a 72-bp 
S'-Tontranslated region, a 1386-bp ORF, and a 127-bp 3'- 

10 tmtranslated region. The ORF encodes a protein consisting of 
461 amino acid residues and there existed a putative 
secretory signal at the N-terminus. Figure 3 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

15 translation resulted in formation of a translation product 
of 56 kDa that was somewhat larger than the molecular weight 
of 52,052 predicted from the ORF. In this case, the addition 
of a microsome led to the formation of a product of 61 kDa. 
In addition, there exists in the amino acid sequence of this 

20 protein two sites at which N-glycosylation may occur (Asn- 
Ser^Ser at position 193 and Asn-Ser-Thr at position 236) . 
Application of the (-3,-1) rule, a method for predicting the 
cleavage site of the secretory signal sequence, allows to 
expect that the mature protein starts from histidine at 

25 position 32. 
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The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to Arabidopsis thaliana hypothetical 
protein {Accession No. CAB41318) . Table 4 shows the 
5 comparison between amino acid sequences of the human protein 
of the present invention (HP) and Arabidopsis thaliana 
hypothetical protein (AT) . Therein, the marks of *, and * 
represent a gap, an amino acid residue identical with that 
of the protein of the present invention, and an amino acid 
10 residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 30.8% in the intermediate region of 214 amino acid 
residues . 

15 Table 4 

HP MPQSLPSSRMAPLGMLLGLLMMCFTFCLSHQ^LKEFALT^^ 

HP ELDAEVLEVFHFTHEWQALQPGQAVPAGSHVRLNLQTGEREMLQ^^ 

20 

AT MPTIFFFRYVFLLVVISLVGFSIAEKVNSSGGMVWSSVRDEAELVEDSGVVIGEQDQ 



25 



HP INTNTYTSQDLKSAIJU(FKEGAEliffiSSKEDmQAEVKRLFRPIEELKKDF 

AT IDGGFSSLDGMLHWAIGHSDPATLKEAAKDAEKMS-LDELQKRQLELKELVEKLK— MPS 
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HP DMQIMVRLINKFNSSSSSLEEKIAALFDLEYYVHtaiDNAQDLLSFGGLQVVINGLNSTEP 
. . . * *...*.** ***. . ** . * *. . . ***. ** . ***. ** . . ** ... 
AT NAKLMQIAIDDLNNSSLSLEDRHRALQELLILVEPIDNANDLSKSGGLRWAGELNHDDT 

5 

HP LVKEYAAFVLGAAFSSNPKVQVEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHF 

*. , *** * ,****...* *** . ♦ *. ***. ..*.*. 

AT EVRKUUVmGKASQNNPFVQEQVLELGALTT-LIKMVNSSSTEEAVKALFAVSALIRNN 

10 HP PYAQRQFLKLGGLQVLRTLVQEKGTEV-LAVRWTLLYDLVTEKMFAEEBAELTQEMSPE - 

.* *, * .+* * ..**.** *. .** 

AT lAGQDLFFAAHGYIMLRDVMMGSLDMLRRMVFLVGDIjyESQLQNTEKDELPIFKDRL 

HP KLQQYRQVHLLPGLWEQGWCEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLG 

15 

AT FLKSVVDLIWLDLDLQEKALTAIQTLLQLKSIEPQVLRESCGLBEALERMKLQLEESMA 

HP RTUSLQAEYQVLASLELQDGEDEGYFQELLGSVNSLLKELR 

20 AT DEYKRDYAADVESIRGEVELIFRQKLGLL 

The search of the GenBank using the base secjuences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example, 
25 Accession No. AW025017) among ESTs. However, since they are 
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partial sequences, it can not be judged, whether or not they 
encode the same protein as the protein of the present 
invention* 

<HP10755> (SEQ ID NOS: 4, 14, and 24) 
5 Deteriaination of the whole base sequence of the 

cDNA insert of clone HP10755 obtained from cDNA library of 
human kidney revealed the structure consisting of a 55-bp 
5' -untranslated region, a 1944-bp ORF, and a 123-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 

10 647 amino acid residues and there existed eight putative 
transmembrane' domains. Figure 4 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte-- 
Doolittle method, of the present protein • In vitro 
translation resulted in formation of a translation product 

15 of high molecular weight. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to human hypothetical protein KIAA0062 
{Accession No. BAA06685). Table 5 shows the comparison 

20 between amino acid sequences of the human protein of the 
present invention (HP) and human hypothetical protein 
KIAA0062 (KI). Therein, the marks of *, and . represent a 
gap, an amino acid residue identical with that of the 
protein of the present invention, and an amino acid residue 

25 siioilar to that of the protein of the present invention, 
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respectively. The both proteins shared a homology of 30. 
in the C-terminal region of 408 amino acid residues. 

Table 5 



HP MASLVSLELGLLLAVLVVTATASPPAGLLSLLTSGQGALDQEALGGLLNTLADRVHCTNG 

HP PCGKCLSVEDALGLGEPEGSGLPPGPVLEARYVARLSAAAVLYLSNPEGTCEDTRAGLWA 

HP SHADHLLALLESPKALTPGLSWLLQRMQARAAGQTPKTAGVDIPQLLEEAVGAGAPGSAG 

KI RVYADAPAKLLLPPPAAWDLAVRLRGAEAASERQVYSVTM 

HP GVLAALLDHVRSGSCFHALPSPQYFVDFVFQQHSSEVPMTLAELSALMQRLGVGREAHSD 

KI KLLLLHPAFQSCLLLTLLGLWRTTPEAHASSLGAPAISAASFLQDLIHRYGEGDSLTLQQ 

HP HSHRHRGASSRDPVPLISSSNSSSVWDTVCLSARDVMAAYGLSEQAGVTPEAWAQLSPAL 

..*,*. *...*... ♦** *. . 

KI LKALimDVGVGRGNVTQHVQGHRNLSTCFSSGDLFTAHNFSEQSRIGSSBLQBFCPTI 

HP LQQQLSGACTSQSRPPVQDQLSQSER YLYGSLATLLICLCAVFGLLLLTCTGGR 

4: **** * .***..*.**...*.,. 

KI LQQLDSRACTSENQENEENEQTEEGRPSAVEVWGYGLLCVTVISLCSLLGASWPFMK-K 
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HP GVAHYILQTFLSUVVGALTGDAVLHLTPKVLGLHTHSEEGLSPQPTWRLLAMUGLYAFF 

. . .* *..**.*.*..*...*.*...*... * *. * ** 

KI TFYKRLLLYFIALAIGTLYSNALFQLIPEAFGFNPL-EDYYVSKSA WFGGFYLFF 

5 HP LFENLFNLLL-PRDPEDLEDGPCGHSS-HSHGGHSHGVSLQLAPSELRQPKPPH EG 

. ♦ ** * .* ** .*...*. *.* 

KI FTEKILKILLKQKNEHHHGHSHYASESLPSKKDQEEGVMEKLQNGDLDHMIPQHCSSELD 

HP SRADLVAE ESPELLNPE PRRLS-PELRLLPYMITLGDAVHNFADGLAV 

10 ..*.*.* ...*... 

KI GKAPMVDEKVIVGSLSVQDLQASQSACYWLKGVRYSDIGTLAWMITLSDGLHNFIDGLAI 

HP GAAFASSWKTGLATSLAVFCHELPHELGDFAALLHAGLSVRQALLLNLASALTAFAGLYV 

15 KI GASFTVSVFQGISTSVAILCEEFPHELGDFVILLNAGMSIQQALFFNFLSAGCCYLGLAF 

HP ALAVGVSEESEAWILAVATGLFLYVALCDMLPAMLKV RDPRPWLLFLLHNVGLLG 

_ . * *. *..**.*.*. *. *. . ***. 

KI GILAG-SHFSANWIFALAGGMFLYISLADMFPEMNEVCQEDERKGSILIPFI IQNLGLLT 

20 

HP GWTVLLLLSLYEDDITF 
KI GFTIMWLTMYSGQIQIG 



25 



Furthermore, the search of the GenBank using the 
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base sequences of the present cDNA has revealed the 
registration of sequences that shared a homology of 90% or 
more (for example. Accession No. AA42490) among ESTs. 
However, since they are partial sequences, it can not be 

5 judged whether or not they encode the same protein as the 
protein of the present invention. 

<HP10760> (SEQ ID NOS : 5, 15, and 25) 
Determination of the whole base sequence of the 
cDNA insert of clone HP10760 obtained from cDNA library of 

10 human kidney revealed the structure consisting of a 61-bp 
5' -untranslated region, a 1341-bp ORF, and a 373-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
446 amino acid residues and there existed a putative 
secretory signal at the N-terminus . Figure 5 depicts the 

15 hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 48 kDa that was somewhat smaller than the molecular 
weight of 49,468 predicted from the ORF. In this case, the 

20 addition of a microsome led to the formation of a product of 
50 kDa. In addition, there exists in the amino acid sequence 
of this protein two sites at which N-glycosylation may occur 
(Asn-Ala-Thr at position 144 and Asn-Ile-Ser at position 
243). Application of the (-3,-1) rule, a method for 

25 predicting the cleavage site of the secretory signal 
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sequence, allows to expect that the laature protein starts 
from glutamic acid at position 27, 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
5 protein was similar to human 25 kDa trypsin inhibitor 
(Accession No. BAA25066) . Table 6 shows the comparison 
between amino acid sequences of the human protein of the 
present invention (HP) and human 25 kDa trypsin inhibitor 
(TI) . Therein/ the marks of *, and . represent a gap, an 
10 amino acid residue identical with that of the protein of the 
present invention, and an amino acid residue similar to that 
of the protein of the present invention, respectively. The 
both proteins shared a homology of 33.5% in the intermediate 
region of 185 amino acid residues. 

15 

Table 6 

HP MLHPETSPGRGHLUVVLLALLGTAWAEVWPPQLQE(3APMAG 

20 TI MIAISAVSSALLFSLLCEASTVVLLNSTDSSPPTNNIT'DIEMLKAQLDSADIPI^^ 

HP ALNRKESFLLLSLHMLRSWVQPPAADMIffiLDWSDSUQUQARAAL^^ 

*. * * . . * * 

TI YISQbffiMIAlLDTOQVRGKVFPPAANMEYMVWDENU^ 

25 
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HP RTLQVGWNMQLLPAGLASFVEWSLVFAEGQRYSHA-AGBC ^AR— NATCTHnQL 

* ♦ * *.*..*.*....* * . . ♦**♦**. 

TI RFLGQN— LSVRTGRYRSILQLVKPWYDEVKDYAFPYPQDCNPRCPMRCFGPMCTHYTQM 

5 HP VWATSSQLGCGRHLCSAGQA~AI EAF-VCAYSPGGNWEVNGKTIIPYKKGAWCSLC 

, . **. * * H=*. *. * *. . *** ** * 

TI WATSNRIGCAIHTCQNMNWGSWRRAVYLVCOTAPKGM— IGEA—^ 

HP TASVSGCFKAWDHAGGLCEVPRNPCRMSCQNHGRLMISTCHCHCPPGYTGR^^^ 

10 .* 

TI PPSYGGSCTDNLCFPGVTSNYLYWFK 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
15 sequences that shared a homology of 90% or more (for example. 
Accession No. AI792411) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

20 <HP10764> (SEQ ID NOS: 6, 16, and 26) 

Determination of the whole base sequence of the 
cDNA insert of clone HP10764 obtained from cDNA library of 
human kidney revealed the structure consisting of a 326-bp 
5' --untranslated region, a 594-bp ORF, and a 452-bp 3'- 

25 untranslated region. The ORF encodes a protein consisting of 
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197 amino acid residues and there existed two putative 
transmembrane domains. Figure 6 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
5 translation resulted in formation of a translation product 
of 25 kDa that was somewhat larger than the molecular weight 
of 21,508 predicted from the ORF. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
10 sequences that shared a homology of 90% or more (for example. 
Accession No. H45965) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention - 

15 <HP10768> (SEQ ID NOS: 7, 17, and 27) 

Determination of the whole base sequence of the 
cDNA insert of clone HP10768 obtained from cDNA library of 
human kidney revealed the structure consisting of a 100-bp 
5' -untranslated region, a 1623-bp ORF, and a 351-bp 3'- 

20 untranslated region. The ORF encodes a protein consisting of 
540 amino acid residues and there existed nine putative 
transmembrane domains. Figure 7 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

25 translation resulted in formation of a translation product 
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of high molecular weight. 

The search of the GenBank using the base sequences 
of the present cDNA. has revealed the registration of 
sequences that shared a homology of 90% or more (for example, 
5 Accession No. AA459236) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

<HP10769> (SEQ ID NOS: 8, 18, and 28) 

10 Determination of the whole base sequence of the 

cDNA insert of clone HPl 07 69 obtained from cDNA library of 
human kidney revealed the structure consisting of a 11-bp 
5' -untranslated region, a 1329-bp ORF, and a 912-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 

15 442 amino acid residues and there existed two putative 
transmembrane domains. Figure 8 depicts the 
hydrophobicity/hydrophi licit y profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

20 of 52 kDa that was somewhat larger than the molecular weight 
of 49,101 predicted from the ORF. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example, 

25 Accession No. AI 625881) among ESTs. However, since they are 
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partial sequences^ it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

<HP10784> (SEQ ID NOS: .9, 19 r and 29) 
Determination of the whole base sequence of the 
cDNA insert of clone HP10784 obtained from cDNA library of 
human kidney revealed the structure consisting of a 60-bp 
5' -untranslated region, a 789-bp ORF, and a 612-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
262 amino acid residues and there existed six putative 
transmembrane domains. Figure 9 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 28 kDa that was almost identical with the molecular 
weight of 27,551 predicted from the ORF. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to rice (Oryza sativa) hypothetical 
protein (Accession No. AM)39600) . Table 7 shows the 
comparison between amino acid sequences of the human protein 
of the present invention (HP) and rice hypothetical protein 
(OS) . Therein, the marks of and . represent a gap, an 

amino acid residue identical with that of the protein of the 
present invention, and an amino acid residue similar to that 
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of the protein of the present invention, respectively. The 
both proteins shared a homology of 40*0% in the intemediate 
region of 195 amino acid residues. 

5 Table 7 

HP MTPEDPEETQPLLGPPGGSAPRGR 
OS MSFRGBESGGEDGGRTASASDLRKPFLHTGSWYKMSSAGGGGGMGSia.GSSAYSLR^ 

10 

HP RVFLAAFAAALGPI^FGFALGYSSPAIPSLQRAAPPAPRLDDAAASWFGAWTLGAAAGG 

OS SAVLCTLIVALGPIQFGFTCGFSSPTQDAI ISDLGLTLSEFSLFGSLSNVGAMVGA 

15 HP VLGGWLVDRAGRKLSLLLCSVPFVAGFAVITAAQDVWMLLGGRLLTGLACGVASLVAPVY 
. . * . . . ♦** **. . . . * . *. . *. *. * . *. ***♦. *. . ** * *. *** 
OS lASGQIAEYIGRKGSLMIAAIPNIIGWLAISFAKDSSFLFMGRLLEGFGVGVISYWPVY 

HP ISEIAYPAVRGLLGSCVQIMWGILUYUGWVLEWRWLAVLGCVPPSLMLLLMCFMPE 
20 *. *** ...** *** ** * . ** *. *** .* , *. ** 

OS lAEIAPQTMRGALGSVNQLSVTIGILLAYLLGMFVPffRILSVLGILPCSILIPGLFFIPE 

HP TPRFLLTQHRRQEAAPGLVRCGHGVQHECLRRLLQADPGWPWQLURGHLGACLCTAC 
.**.*. * 

25 OS SPRWLAKMGKMEDFESSLQVLRGFETDIAVBVNEIKRSVQSSRRRTTIRFADIKQKRYSV 
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The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example, 
Accession No. AW028826) among ESTs, However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

<HP10786> {SEQ ID NOS: 10, 20, and 30) 
Determination of the whole base seqaeuce of the 
cDNA insert of clone HP10786 obtained . from cDNA library of 
himan kidney revealed the structure consisting of a 78-bp 
5' --untranslated region, a 459-bp ORE, and a 585-bp 3'- 
untranslated region. The ORE encodes a protein consisting of 
152 amino acid residues and there existed a putative 
secretory signal at the N-terminus. Figure 10 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 17 kDa that was almost identical with the molecular 
weight of 16,904 predicted from the ORE. 

Furthermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 
registration of sequences that shared a homology of 90% or 
more (for example. Accession No. AW052022) among ESTs. 
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However, since they are partial sequences/ it can not be 
judged whether or not they encode the same protein as the 
protein of the present invention* 

<HP03727> (SEQ ID NOS: 31, 41, and 51) 
5 Determination of the whole base sequence of the 

cDNA insert of clone HP03727 obtained from cDNA library of 
htuaan kidney revealed the structure consisting of a 254-bp 
5' -untranslated region, a 1008-bp ORF, and a 355-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 

10 335 amino acid residues and there existed one putative 
transmembrane domain. Figure 11 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

15 of 41 kDa that was somewhat larger than the molecular weight 
of 37,999 predicted from the ORF. 

The search of the protein database using the aiaino 
acid sequence of the present protein revealed that the 
protein, was similar to protein MG87 from diabetic rat 

20 kidney (Accession No. AAC64190) . Table 8 shows the 
comparison between amino acid sequences of the human protein 
of the present invention (HP) and protein MG87 from diabetic 
rat kidney (RD) . Therein, the marks of and . represent 

a gap, an amino acid residue identical with that of the 

25 protein of the present invention, and an amino acid residue 
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similar to that of the protein of the present . invention, 
respectively. The both proteins shared a homology of 74.2% 
in the entire region. 

5 Table 8 

HP MGASSSSAUM.GLPARPWPRWU)VAAyjy\AVALGTVAWRMWPRRIffiRLQQVGTV^ 

RD MGSSSSTALARLGLPGQPRSTWLGVAALGLAAVALGTVAWRRARPRRRRQLQQVGTVSKV 

10 

HP WIYPVKSCKGVPVSEAECTAMGLRSGNLRDRFWLVIKEDGHMVTARQEPRLVLISIIYEN 

RD WIYPIKSCKGVSVCETECTDMGLRCGKVRDRFWMWKEDGHMITARQEPRLVLVTITLEN 

15 HP NCLIFRAPDMDQLVLPSKQPSSNKLHNCRIFGLDIKGRDCGNEAAKWFTNFLKTEAYRLV 

RD NYyaLEAPGMEPIVLPIKLPSSNKIHDCRLFGLDIKGRDCGDEVARWFTSYLKTQAYRLV 

HP QFET^MGRTSRKLLPTLD— QNFQVAYPDYCPLLIMTDASLVDL^^rRMEKKMI^^ 

20 **.*.**♦**,.***, . **..*****..* *********..**,****** 

RD QFDTmGRTTKKLYFSESYLQNyEVAYPDCSPIHLISEASLVDLNTRLQKKVKMEYFRP 

HP NimGCDAFEEDTWDELLIGSVEVKKVMACPRCILTTVDPDTGVIDRKQPLDTLKSYRL 
**. *************. **. *. *. . ****. *********. **♦*. **. ******* 

25 RD NIWSGCEAFBEDTWDELLIGDVEMKRVLSCPRCVLnVDPDTGIIDRKEPLETLKSYRL 
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HP CDPSERELYKLSPLFGIYYSVEKIGSLRVGDPVYRMV 
RD CDPSYKSLYQSSPLFGMYFSVEKIGSLRVGDPVYRMVD 

5 

The search of the GenBank using the base sequences 
of the present cDNZV has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AI912794) among ESTs. However, since they are 
10 partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

<HP03801> (SEQ ID NOS: 32, 42, and 52) 
Determination . of the whole base sequence of the 

15 cDNA insert of clone HP038 01 obtained from cDNA library of 
human ximbilical cord blood revealed the structure consisting 
of a 158-bp 5' -untranslated region, a 627-bp ORF, and a 964- 
bp 3' -untranslated region. The ORF encodes a protein 
consisting of 208 amino acid residues and there existed- six 

20 putative transmembrane domains. Figure 12 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 23 kDa that was almost identical with the molecular 

25 weight of 22,526 predicted from the ORF. 
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The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to human hypothetical protein CGI-15 
(Accession No. AAD27724) . Table 9 shows the comparison 
5 between amino acid sequences of the human protein of the 
present invention (HP) and human hypothetical protein CGI- 
15 (CP) . Therein, the marks of *, and . represent a gap, 
an amino acid residue identical with that of the protein of 
the present invention, and an amino acid residue similar to 
10 that of the protein of the present invention, respectively. 
The amino acid sequences of the two proteins were completely 
different each other in the N-terminal, intermediate and C- 
terminal regions although partial match was observed. 

15 Table 9 

HP MELRAALVLmLIAGGLFMFTYKSTQFNVEGFALVLGASFIGGIRW 

CP VlPILIFSLIFKLEELRAALVLmLIAGGLFMFTYKSTQFNVEGFAWCWGPRSSV 

20 

HP TLTQMLLQKAELGLQNPIDTMFHLQPLMFLGLFPLFAVFEGLHI^TSEKI 

CP PSPRCSCRRLNSASRIPSTPCSTCSHSCSWGLFPLFAVFEGLHLSTSEKIFRFQDTGLLL 

25 HP RVLGSLFLGGILAFGLGFSEFLLVSRTSSLTLSIAGIFKEVCTLLywiHLLGDQlSLW 
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CP RVLGSLFLWJILAFGLGFSEFLLVSRTSSLTLSIAGIFKEVCTLLljymL^^ 

HP LGFALCLSGISLHVALKALHSRGNPESLPEASVFCSSPCDS 

5 **** 

CP LGFASASREYPSTLPSKPCIPEVMVAPRP 



Furtliermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 

10 registration of sequences that shared a homology of 90% or 
more {for example ^ Accession No. AI741613) among ESTs. 
However, since they are partial sequences, it can not be 
judged whether or not they encode the same protein as the 
protein of the present invention* 

15 <HP03883> (SEQ ID NOS ; 33, 43, and 53) 

Determination of the whole base sequence of the 
cDNA insert of clone HP03883 obtained from cDNA library of 
human kidney revealed the structure consisting of a 59-bp 
5' -untranslated region, a 122l'-bp ORF, and a 122-bp 3^^- 

20 untranslated region. The ORF encodes a protein consisting of 
406 amino acid residues and there existed eight putative 
transmembrane domains. Figure 13 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

25 translation resulted in formation of a translation product 
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of high molecular weight. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to human choline/ethanolamine 
5 phosphotransferase (Accession No. NP_006081) . Table 10 shows 
the comparison between amino acid sequences of the human 
protein of the present invention (HP) and human 
choline/ethanolamine phosphotransferase (CE) • Therein, the 
marks of */ and • represent a gap, an amino acid residue 

10 identical with that of the protein of the present invention, 
and an amino acid residue similar to that of the protein of 
the present invention, respectively. The both proteins 
shared a homology of 66.8% in the entire region. In 
addition, the amino acid sequence from position 70 to 

15 position 311 of the present protein shared a homology of 
98.3% with human MPTl-like protein (Accession No. AAD44019) . 

Table 10 

20 HP MAAGAGAGSAFRWLRALSEFLSAAQLRRLEEHRYSAAG 
CE MSGHRSTRKRCGDSHPESPVGFGHMSTTGCVIMLFQLPTPPLSRHQLKRLEEHRY^^^ 



HP VSLLEPPLQLYWTWLLQWIPLWMAPNSITLLGLAVNmiLVLIS^^^ 
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CE RSLLEPLMQGYWEWLVRRVPSflAPNLITIIGLSINICTTILLVFYCPTATEQAPLWAYI 

HP LCALGLFIYQSLDAIDGKQARRTNSCSPLGELFDHGCDSLSTVFMAVGASIAARLGTYPD 

CE ACACGLFIYQSLDAIDGKQARRTNSSSPLGELFDHGCDSLSTVFWLGTCIAVQLGTNPD 

HP WFFFCSFIGMFVFYCAHWQTYVSGMLRFGKVDVTEIQIALVIVFVLSAFGGATMWDYTIP 

CE WMFFCGFAGTFMFYCAHWQTYVSGTLRFGI IDVTEVQIFIIIMHLLAVIGGPPFWQSMIP 

HP ILEIKLKILPVLGFLGGVIFSCSNYFHVILHGGVGKNGSTIAGTSVLSPGLHIGLIIILA 

CE VLNIQMKIFPALCTVAGTIFSCTNYFRVIFTGGVGKNGSTIAGTSVLSPFLHIGSVITLA 

HP IMIYKKSATDVFEKHPCLYILMFGCVFAKVSQKLVVAHMTKSELYLQDTVFLGPGLLFLD 
. , ********** ***♦*... ***********. . *. **. *. **. ***** 
CE AMIYKKSAVQLFEKHPCLYILTFGFVSAKITNKLWAHMTKSEMHLHDTAFIGPALLFLD 

HP QYFNNFIDEYVVLWMAMVISSFDMVIYFSALCLQISRHLHLNIFKTACHQAPEQVQVLSS 

****. **♦**. ***. *.*.***.. * ..***.. ***. . . *. 
CE QYFNSFIDEYIVLWIALVFSFFDLIRYCVSVCNQIASHLHIHVFRIKVSTAHSNHH 



The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
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Accession No. AI816449) among ESTs- However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

5 <HP03913> (SEQ ID NOS: 34, 44, and 54) 

Determination of the whole base sequence of the 
cDNA insert of clone HP03913 obtained from cDNA library of 
human kidney revealed the structure consisting of a 344-bp 
5' -untranslated region, a 1857-bp ORF, and a 273-bp 3'- 

10 untranslated region. The ORF encodes a protein consisting of 
618 amino acid residues and there existed thirteen putative 
transmembrane domains. Figure 14 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

15 translation resulted in formation of a translation product 
of high molecular weight. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to human solute carrier family 5 

20 (Accession No. NP_000444) . Table 11 shows the comparison 
between amino acid sequences of the human protein of the 
present invention (HP) and human solute carrier family 5 
(SC) . Therein, the marks of *, and . represent a gap, an 
amino acid residue identical with that of the protein of the 

25 present invention, and an aitiino acid residue similar to that 
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of the protein of the present invention, respectively. The 
both proteins shared a homology of 48.3% in the entire 
region . 

5 Table 11 

HP MEVKNFAVWDYVVFAAIi^FISSGIGVFFAIKERmTSREFLVWiR^ - 

SC MEAVETGERPTFGAVTOYGVFALMLLVSTGIGLWVGLARGGQRSAEDFFTGGRRLAALPVG 

10 

HP LSLTASFMSAVTVLGTPSEVYRFGASFLVFFIAYLFVILLTSELFLPVFYRSGITSTYEY 

^^^*^-^*pijt ^^^^ i^^^^ ^'j^^ V* » *I**7* ^ , , 'v*^ , 'l^n*, 'IM^, ^^^^^ -p^ -t^*T>'F'r>^ 

SC LSLSASFMSAVQVLGVPSEAYRYGLKFLWMCLGQLLNSVLTALLFMPVFYRLGLTSTYEY 

15 HP LQLRFNKPVRYAATVIYIVQTILYTGVWYAPAULNQVTGFDLWGSVFATGIVCTFYCT 
*. . **. . . ** . *. ***. *, ****. *. ***** ******. *. *.♦... ***. **** . 
SC LEMRFSRAVRLCGTLQYIVATMLYTGIVIYAPALILNQVTGLDIffASLLSTGIICTFYTA 

HP LGGLKAWWTDAFQMVVMIVGFLTVLIQGSTHAGGFHNVLEQSTNGSRLHIFDFDVDPLR 

20 . **. ♦*.**.* .**..**. . * ** **. ** . 

SC VGGMMVVWTDVFQVVVMLSGFWVVLARGVMLVGGPRQVLTLAQNHSRINLMDFNPDPRS 

HP RHTFWTITVGGTFTWLGIYGVNQSTIQRCISCin'EKHAKLALYFNLLGLWIILVCAVFSG 

*_ ***!»:_ _ ****, , _ *****. .**... *. ***. ***** .*.**..*. . *. . * 

25 SC RYTFWTFWGGTLVWLSMYGVNQAQVQRYVACRTEKQAKULLINQVGLFLIVSSAACCG 
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HP LIMYSHFKDCDPWTSGI ISAPDQLMPYFVMEIFATMPGLPGLFVACAFSGTLSTVASSIN 

. . *. . . *SN!* * ****** ** .*. .♦*...**. ****. ***. ******. . . *** 

SC IVMFVFYTDCDPLLLGRISAPDQYMPLLVLDIFEDLPGVPGLFLACAYSGTLSTASTSIN 

5 

HP AUTVTFEDFVKSCFPHLSDKLSTWISKGLCLLFGVMCTSMAVAASVM-GGWQASLSIH 

*.*.****..*..,*.. . *****.♦..* * ..*. .*. . ***.*.*... 
SC AMAAYTVEDLIKPRLRSLAPRKLVIISKGLSLIYGSACLTVAALSSLLGGGVLQGSFTVM 

10 HP GMCGGPMLGLFSLGIVFPFVNWKGALGGLLTGITLSFWVAIGAFIYPAPASKTflfPLPLST 

+ _+*_*****, . * * *.*,**.*..**. ***. ** . ** ** *. 

SC GVISGPLLGAFILGMFLPACNTPGVLAGLGAGLALSLWVALGATLYPPSEQTMRVLPSSA 

HP DQCIKSNVTATG PPVL SSRPGIADTWYSISYLYYSAVGCLGCI 

15 ..*..*.*.* *. . * • ***. .**..*. ******, *. * *. . 

SC ARCVALSVNASGLLDPALLPANDSSRAPSSGMDASRPALADSFYAISYLYYGALGTLTTV 

HP VAGVIISLITGRQRGEDIQPLLIRPVCNLFCFWSKKYKTLCWCGVQHDSGTEQENLENGS 

20 SC LCGALISCLTGPTKRSTLAPGLLWWDLARQTASVAPKEEVAILDDNLVKGPEELPTGNKK 

HP ARKQGAESVLQNGLRRESLVHVPGYDPKDKSYNNMAFETTHF 

SC PPGFLPTNEDRLFFLGQKELEGAGSWTPCVGHDGGRDQQETNL 

25 
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The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AI733508) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention, 

<HP10753> (SEQ ID NOS: 35, 45, and 55) 
Determination of the whole base sequence of the 
cDNA insert of clone HP10753 obtained from cDNA library of 
h\3man umbilical cord blood revealed the structure consisting 
of a 141"bp 5' -untranslated region, a 627-bp ORF, and a 
2528-bp 3 ''-untranslated region. The ORF encodes a protein 
consisting of 208 amino acid residues and there existed a 
putative secretory signal at the N-terminus and one putative 
transmembrane domain in the inner portion. Figure 15 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 28 kDa that was somewhat larger than the molecular weight 
of 21,518 predicted from the ORF. Application . of the (-3,-1) 
rule, a method for predicting the cleavage site of the 
secretory signal sequence, allows to expect that the mature 
protein starts from methionine at position 32. 
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The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No, AW162064) among ESTs. However, since they are 
5 partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

<HP10758> (SEQ ID NOS: 36, 46, and 56) 
Determination of the whole base sequence of the 

10 cDNA. insert of clone HP10758 obtained from cDNA library of 
human kidney revealed the structure consisting of a 25-bp 
5' -untranslated region, a 1509-bp ORF, .and a 284-bp 3'-, 
untranslated region. The ORF encodes a protein consisting of 
502 amino acid residues and there existed a putative 

15 secretory signal at the N-terminus and one putative 
transmembrane domain in the inner portion. Figure 16 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

20 of 60 kDa that was somewhat larger than the molecular weight 
of 55,848 predicted from the ORF. In this case, the addition 
of a microsome led to the fonaation of a product of 66 kDa. 
In addition, there exists in the amino acid sequence of this 
protein six sites at which N-glycosylation may occur (Asn- 

25 Val-Ser at position 67, Asn-Tyr-Thr at position 103, Asn- 
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Phe-Thr at position 156, Asn-Ile-Thr at position 183, Asn^ 
Phe-Thr at position 197 and Asn-Lys-Ser at position 283) . 
implication of the (-3,-1) rule, a method for predicting the 
cleavage site of the secretory signal sequence, allows to 
5 expect that the mature protein starts from alanine at 
position 15. 

The search of the GenBank using the base sequences 
of the present cDNA. has revealed the registration of 
sequences that shared a homology of 90% or more (for example, 
10 Accession No. T96740) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the. present 
invention . 

<HP10771> (SEQ ID NOS: 37, 47, and 57) 
15 Determination of the whole base sequence of the 

cDNA insert of clone HP10771 obtained from cDNA library of 
human kidney revealed the structure consisting of a 36-bp 
5' -untranslated region, a 1011-bp ORF, and a 599-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
20 336 amino acid residues and there existed one putative 
transmembrane domain at the N-terminus. Figure 17 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
25 of 41 kDa that was somewhat larger than the molecular weight 
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of 37,924 predicted from the ORF- 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to human interferon-a induced protein 
5 (Accession No. AR053364) . The C-terminal portion downstream 
from methionine at position 51 of the protein of the present 
invention matched with the C-terminal portion downstream 
from methionine at position 12 of hxman interf eron-of induced 
protein. However, the putative transmembrane domain at the 

10 N-terminus observed for the protein of the present invention 
was not present in human interferon-a induced protein. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example, 

15 Accession No. AA452543) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention, 

<HP10778> (SEQ ID NOS; 38, 48, and 58) 
20 Determination of the whole base sequence of the 

cDNA insert of clone HP1077B obtained from cDNA library of 
human kidney revealed the structure consisting of a 173-bp 
5' -untranslated region, a 1023-bp ORF, and a 220-bp 3'- 
vintranslated region. The ORF encodes a protein consisting of 
25 340 amino acid residues and there existed six putative 
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transmembrane domains- Figure 18 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
5 of high molecular weight. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AA429745) among ESTs. However, since they are 
10 partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

<HP10781> (SEQ ID NOS : 39, 49, and 59) 
Determination of the whole base sequence of the 

15 cDm insert of clone HP10781 obtained from cDNA library of 
human kidney revealed the structure consisting of a 88-bp 
5' -untranslated region, a 672-bp ORF, and a 1167-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
223 amino acid residues and there existed a putative 

20 secretory signal at the N- terminus. Figure 19 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in fojnaation of a translation product 
of 31 kDa that was larger than the molecular weight of 

25 24,239 predicted from the ORF- In this case, the addition of 
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a microsome led to the formation of a product of 33 kDa* In 
addition, there exists in the amino acid sequence of this 
protein two sites at which N-glycosylation may occur (Asn- 
Asn-Thr at position 70 and Asn-Thr-Ser at position 71) . 
5 Application of the (-3,^1) rule, a method for predicting the 
cleavage site of the secretory signal sequence, allows to 
expect that the mature protein starts from gluthamine at 
position 23. 

The search of the GenBank using the base sequences 
10 of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AA334609) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
15 invention. 

<HP10785> (SEQ ID NOS: 40, 50, and 60) 
Determination of the whole base sequence of the 
cDNA insert of clone HP10785 obtained from cDNA library of 
human kidney revealed the structure consisting of a 171-bp 
20 S'-Tintranslated region, a 930-bp ORF, and a 318-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
309 amino acid residues and there existed six putative 
transmembrane domains. Figure 20 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
25 Doolittle method, of the present protein. In vitro 
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translation resulted in formation of a translation product 
of high molecular weight. 

Furthermore, the search of the GehBank using the 
base sequences of the present cDNA has revealed the 
5 registration of sequences that shared a homology of 90% or 
more (for exanple. Accession No* AI822041) among ESTs. 
However, since they are partial sequences, it can not be 
judged whether or not they encode the same protein as the 
protein of the present invention • 

10 <HP03878> (SEQ ID NOS: 61, 71, and 81) 

Deteirmination of the whole base sequence of the 
cDNA insert of clone HP03878 obtained from cDNA library of 
human kidney revealed the structure consisting of a 77-bp 
5' -untranslated region, a 1800-bp ORF, and a 139-bp 3'- 

15 untranslated region. The ORF encodes a protein consisting of 
599 amino acid residues and there existed ten putative 
trcinsmembrane domains. Figure 21 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

20 translation resulted in formation of a translation product 
of high molecular weight. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to flounder (Pseudopleuronectes 

25 americanus) Na/Pi cotransport system protein {Accession No. 
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AAB16821) , Table 12 shows the comparison between amino acid 
sequences of the human protein of the present invention (HP) 
and flounder Na/Pi cotiransport system protein (PN) . Therein, 
the marks of -/ */ and . represent a gapr an amino acid 
5 residue identical with that of the protein of the present 
invention, and an amino acid residue similar to that of the 
protein of the present invention, respectively. The both 
proteins shared a homology of 57,1% in the region of 545 
amino acid residues other than the N-terminal and C-terminal 
1 0 regions - 
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Table 12 

HP MPSSLPGSQVPflPTlDAVDLVEKTLRNEGTSSSAPVLEEGDTDPWTLPQLKDTSQPWKEL 

FN MAPRQKVGTNSSPKPALDDDAPVGNIPPAYSTLDLVSDDPDADPffNAPELIDNGVKWSEL 

HP' RVAGRLRRVAGSVLKACGLLGSLYFPICSLDVLSSAFQLLGSKVAGDIFKDNYVLSNPYA 

PN DTKGKMMRVLTGILKLVALLGLLYFFICSLDYLSSAFQLVGGKAAGDIFKDNAVLAMPVA 

HP GLYIGVLVTALVQSSSTSSSIWSMVAAKLLTVRVSYPI IMGYNVGTSITSTLVSMAQSC 
ififun*n. . ifunit*********. . **. *. . ******. ». *, *. *, * *. » 
PN GLVIGVLYTVMVQSSSTSSSIVVSMVSSGLLDVQSAVPIIMGANIGTSYTNTIVAMMQAG 

HP DRDEFQRAFSGSAVHGIFNWLTVLVLLPLESATALLERLSELALGAASLTPRAQAPDILK 

»*. ». . **. . *»**. **. ***** **..*.*..* ***. *. 

PN DKNEFRRAFAGATVHDFFNWLAVLILLPLEYATGVLmTHLI lESFNIQGGEDAPDLLN 

HP VLTKPLTHLIYQLDSDMI-MSSATGNATNSSLIKHWCGTTGQPT — QENSSCGAFGPC 
*. *. m**. . . * ...:...*.* ****. **»...» ....*.».* 
PN YITDPLTDSIYQLDKNVISLIATNDEAAYNMSL IKEffCKTKTNVTFWNATYENCTAGALC 



HP TEKNSTA ^PADRLPCRHLFAGTELTDLAYGCILLAGSLLVLCGCLYLIVKLLN 

*. . . . ...*.*.**. *. *. ***** **** **. ***. **. ******* 

PN MGNLTWTMLNKTffllHQERCKHIFANTTLPDLAYGLILLALSLFVLCTCLILIVKLLN 
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HP SVLRGRVAQVVRTVIHADFPFPLGWLGGYLAVLAGAGLTFALQSSSVFTAAYVPLMGYGV 
*. *. *. ** . ***. *****. *. . **. *. . . ***. ** . *******. *. . **. *. ** 
PN SMLKGQVAWIKRVINTDFPFPFCPTGYIAIFVGAGMTFIYQSSSYFTSAITPLVGIGV 

HP ISLDRAYPLLLGSNIGTTTTALLAALASPADRMLSALQVALIHFFFNLAGILLWYLVPAL 

PN ISLERAmTLGSNIGTTnAILAAMASPAEKLKESLQIALCHFFFNVMGILLFlPIPFT 

HP RLPIPLARHPGVyTARYRWVAGVYLLlGFLLLPLAAFGLSLAGGMVLAAyGGPLYGLVLL 
*. **. *n . * **. *** **. **. * *». . **. . ****. ** **. . »* *. * *. . . 
PN RWMARGLGMrAm.WAGLYLVLCFLYFPLTYFGLSMAGWQVLVGYGVPFmiYF 

HP YILVTYLQRRRPAWLPYiaRSWAWLPYWLHSLEPWDRLYTRCCPCNYCSPPKATTKEAYC 

PN YIVVNYMQSRCPRFLPKVLQDTOFLPRPLHSMAPTOTYYrSALGFCGKYCCCCCKCCKKT 

HP YENPEILASQQL 

PN EDEimiKNNTKSLEMYDNPSEDEDmSKiVm 
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The search of the GenBank using the base sequences 
of the present cDNft. has revealed the registration of 
sequences that shared a homology of 90% or more (for example^ 
Accession No. M792826) among ESTs. However, since they are 
5 partial seqaenceSf it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

<HP03884> (SEQ ID NOS : 62, 12, and 82) 
Determination of the whole base sequence of the 

10 cDNA. insert of clone HP03884 obtained from cDNA library of 
human kidney revealed the structure consisting of a SSe-bp 
5' -untranslated region, a 246-bp ORF, and a 864-bp 3''- 
iintranslated region. The ORF encodes a protein consisting of 
81 amino acid residues and there existed one putative 

15 transmembrane domain. Figure 22 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte^ 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 10 kDa that was almost identical with the molecular 

20 weight of 8,928 predicted from the ORF. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to rat cortexin (Accession No* P41237) . 
Table 13 shows the coirparison between amino acid sequences 

25 of the human protein of the present invention (HP) and rat 
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cortexin (RC) . Therein, the marks of *, and . represent a 
gap, an amino acid residue identical with that of the 
protein of the present invention, and an amino acid residue 
similar to that of the protein of the present invention, 
5 respectively. The both proteins shared a homology of 47.9% 
in the entire region. 

Table 13 

HP MDGGQPIPSSLmGNESADSSMSLEtMTFWVILLFIFLGILIVRCFR^ 

10 * . *. m *mm. . * 

RC MSAPmSPEPLPPSTGPPVGAGLDVE(ffiT¥FAFVLCLLmYLLMVRC¥RILL^^^ 

PTSTWADGLEGLEKGQFDHALA 
». *. *, * *. ** 
PASSWTDHKEALERGQFDYALV 

Furthermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 
registration of sequences that shared a homology of 90% or 
20 more (for example, Accession No. AI791379) among ESTs. 
However, since they are partial sequences, it can not be 
judged whether or not they encode the same protein as the 
protein of the present invention . 

<HP03934> (SEQ ID NOS: 63, 73, and 83) 
25 Determination of the whole base sequence of the 



HP 

15 
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cJMA insert of clone HP03934 obtained from cDNA library of 
human kidney revealed the structure consisting of a 39-bp 
5' -untranslated region, a 1965-bp ORF, and a 463-bp 3'- 
untranslated region* The ORF encodes a protein consisting of 
5 654 amino acid residues and there existed a putative 
secretory signal at the N-terminus. Figure 23 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

10 of 80 kDa that was larger than the molecular weight of 
74,110 predicted from the ORF. Application of the (-3,-1) 
rule, a method for predicting the cleavage site of the 
secretory signal sequence, allows to expect that the mature 
protein starts from arginine at position 28. 

15 The search of the protein database using the amino 

acid sequence of the present protein revealed that the 
protein was similar to human p-galactosidase (Accession No. 
AAC12775) . Table 14 shows the comparison between amino acid 
sequences of the human protein of the present invention (HP) 

20 and human p-galactosidase (BG) - Therein, the marks of *, 
and . represent a gap, an amino acid residue identical with 
that of the protein of the present invention, and an amino 
acid residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 

25 of 54.6% in the entire region- 
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Table 14 

HP. MAPKKLSCLRSLLLPLSITLL LPQADTRSFWDRGHDRFLLDGAPFRYVSGSLHY 

BG MPGFLVRILLLLLVLLLLGPTRGL8MTQRMFBIDYSRDSFLKDGQPFRYISGSIHY 

HP FRWRVLWmLKMRWSGLNAIQFTOMMPQPGVYHFNGSRDLIAFLNEAALANLL 
H** * . ****»» ***»*. *♦* ** ». *. . . . *. **. * . ♦» 

BG SRWRFYMDRLLIQMAGLHAIQTYVPffNFHEPWPGQYQFSEDHDVETFLRLAHBLGLL. 

HP YILEPGPYICAEWEMGGLPSWLLRKPEIHLRTSDPDFLAAYDSWFKVLLPKIYPWLYHNG 
tt*u**********if***. ****.* **. »****. *. * ** 

BG VILRPGPYICMWEMGGLPAmEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNG 

HP GNIISIQVENEYGSYRACDFSYMRHLAGLPSALLGEKILLFTTDGPE— GLKCGSLRGLY 

BG GPYITYQVENEYGSYFACDFDYLRPLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLY 

HP TTVDFGPADNMTKIFllLRKYEPHGPLYNSEYYTGEDYWGQMSTRSYSAVtKGLBNML 

BG TTVDFGTGSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDIL 

HP KLGASVMYMFHGGTNFGYWNGADKKGRFLPITTSYDYDAPISEAGDPTPKLFALRDYIS 
u* ***** ****♦*♦*. ***** * * **♦*. . » 

BG ARGASVNLYMFIGGTNFAYMGAN-SPYAAQPTSYDYDAPLSEAGDLTEKYFALRNIIQ 
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HP KFQEVPLGPLPPPSPKMMLGPVTLHLVGHLLAFLDLLCPRGPIHSILPMTFEAVKQDHGF 

BGPEEVPEGPIPPSTPKFAYGKVTLEEKTVGAALDILCPSGPIKSLTPLTFIQYKQHYGF 

HP MLYRramiFEPTPFWVPMGVBDMYVMVDGWQGVVERMRDKLFLTGKLGSKLDI 
. **** *. ♦ ********* . * . *. . **. 

BG YLYRTTLPQDCSNPAPLSSPLHGVHDRAYVAYDGIPQGYLERMVITLNITGKAGATLDL 
HP LVEMGRLSFGSNSSDFKGLLKPPILGQTILTQMMFPLKIDNLVK ^ff-W-FPLQ 

BGLVEHMGRVNYGAyiNDPKGLVSNLTLSSNILTDWTIFPLDTEDAVRSHLGGWGHRDSGHH 

HP LPKWPYPQAP-SGPTFYSKTFPILGSYGD ^TFLYLPGTOGQWINGFNLGRYWTKQ 

*. . . .*.**.*.♦...* *». . 

BGDBAWABNSSNYTLPAFYMGPSIPSGIPDLPQDTFIQFPGWTKGQVWINGFNLGRYWPAR 

HP GPQQTLYYPRFLLFPRGALNKITLLELE ^DVPLQPQYQFLDKPILNSTSTLHRTH 

*** **. . * *. **. ***» * * .*«.*.*...*.*.. 

BG GPQLTLFVPQHILMTSAP-NTimELEWAPCSSDDPELCAYTPVDRPYIGSSVTYDHPS 

HP INSLSADTLSASEPMELSGH 
BG KPYEKRLMPPPPQKNKDSWLDHY 
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The search of the GenBank using the base seqaences 
of the present cDm has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AI907720) among ESTs- However, since they are 
5 partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention, 

<HP03949> (SEQ ID NOS: 64, 74, and 84) 
Determination of the whole base sequence of the 

10 cDNA insert of clone HP03949 obtained from cDNA library of 
human kidney revealed the structure consisting of a 244-bp 
5' -untranslated region, a 1173-bp ORF, and a 33-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
390 amino acid residues and there existed ten putative 

15 transmembrane domains. ' Figure 24 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of high molecular weight- 

20 The search of the protein database using the amino 

acid sequence of the present protein revealed that the 
protein was similar to hman solute carrier family 16 
(Accession Wo. ISIM_004696) . Table 15 shows the coHiparison 
between amino acid sequences of the human protein of the 

25 present invention (HP) and human solute carrier family 16 



wo 01/49728 



PCT/JPOO/09359 



103 

(HS) . Therein, the marks of */ and . represent a gap, an 
amino acid residue identical with that of the protein of the 
present invention, and an amino acid residue similar to that 
of the protein of the present invention, respectively* The 
both proteins shared a homology of 98.7% in the region other 
than the N- terminal and C-terminal regions. 
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Table 15 

HP MGMDDCDSFFPGPLVAIICDIlGEKnSILGAFWTGGYLISSffATSIPFLCVTMGLL 

HS WIGSIMSSLPCAGPLVAI ICDILGEKnSILGAFYVTGGYLISSWATSIPFLCVTMGLL 

HP PGLGSAFLYQVAAVVTTnFKKRLALSTAIAIlSGMGLmLAPFTKFLIDLYDWTGA^ 

HS PGLGSAFLYQYAAYVrraFKmALSTAIARSGMGLTFLLAPFTKFLIDLYDfrGALIL 

HP FGAIALNLVPSSMLLRPIHIKSENNSGIKDKGSSLSAHGPEAHATETflCHETEESTIKDS 

HS FGAIALNLVPSSELRPIHIKSEMSGIKDKGSSLSAHGPBAHATETHCHETEESTIKDS 

HP nOKAGLPSKETVSQNQSEEFYNGPNRNRLLLKSDEESDKVISWSCKQLFDISLFRNPF 

^ HS nQKAGLPSKEmONQSEEFYNGPNRmLLKSDEESDKVISWSCKQLFDISLFIOjPF 

HP FYIPTWSFLLSQLAYFIPTFHLVARAKTLGIDIMDASYLVSYAGILETVSQI ISGWVADQ 

HS FYIFTWSFLLSQLAYFIPTFHLVARAKTLGIDIJIDASYLYSVAGILBTVSQI I SGWVADQ 

HP NWIKKYHYHKSYLILCGimiAPLATTFPLLMTYTICFAIFAGGYLALILPYLVDLCRN 

HS MIKmYHKSYLILCGITNLLAPLATTFPLLMTniCFAIFAGGYLALILPVLYDLCRN 

HP STYNRFLGLASFFAGMAYLSGPPIAGNTFTTF 

HS STVNSFLGLASFFAGMYLSGPPIAGTOYTQTYNGSFYFSGICYLLSSVSFFFVPLAE 
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The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example, 
Accession No. Mr239415) among ESTs. However, since they are 
5 partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention* 

<HP03959> (SEQ ID NOS: 65, 75, and 85) 
Determination of the whole base sequence of the 

10 cDNA insert of clone HP03959 obtained from cDNA library of 
human kidney revealed the structure consisting of a 7-bp 5'- 
untranslated region, a 1359-bp ORF, . and a 531-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
452 amino acid residues and there existed a putative 

15 secretory signal at the N-tenainus. Figure 25 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 53 kDa that was somewhat larger than the molecular weight 

20 of 50,798 predicted from the ORF. In this case, the addition 
* of a microsome led to the formation of a product of 55 kDa. 
In addition, there exists in the amino acid sequence of this 
protein three sites at which N-glycosylation may occur (Asn- 
Phe-Ser at position 64, Asn-Gly-Ser at position 126 and Asn- 

25 Val-Thr at position 362). Application of the (-3,-1) rule, a 
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method for predicting the cleavage site of the secretory 
signal sequence, allows to expect that the mature protein 
starts from alanine at position 27. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to Arabidopsis thaliana putative 
carboxypeptidase (Accession No* AAD21510) . Table 16 shows 
the comparison between amino acid sequences of the human 
protein of the present invention (HP) and Arabidopsis 
thaliana putative carboxypeptidase (AC) . Therein, the marks 
of and . represent a gap, an amino acid residue 

identical with that of the protein of the present invention, 
and an amino acid residue similar to that of the protein of 
the present invention, respectively. The both proteins 
shared a homology of 44.3% in the region of 323 amino acid 
residues other than the N-tenuinal and C-terminal regions. 
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Table 16 

HP mALMSPmi.LLLPLLLGLNAGAVIDffPTEEGKEVl)YYTVBB3)AWWWLmTN 

AC MDPKLGDTSKLDQHTCFGGIIKV 

HP SCKNFSELPLmQGGPGGSSTGPGNFEEIGPLDSOLKPRKTTWLQAASLLFVDNPVGT 

AC HIELKILPSHGLSSSGSKGASGYGIGNFQEYGPLDTFLKPMSTWLKKADLLFVDSPYGA 

HP GFSYVNGS— GAYAKDLAMVASDMmLKTFFSCHKBFQTVPFYIFSESYGGKMAAGIGL 

.».*...**. *». . * *. . * . ******. ». . ** 

AC GYSFVEGNQPLYYKSDEEAAmTKLLQQLFNKNQTLNQSPLFIYAESYGGKIAVKLGL 

HP ELYKAIQRGTIKCNFAGYALGDSWISPYDSYLSWGPYLYSMSLLEDKGLAEYSKYAEQYL 

.. .*.*.*..*...** ******** * *. **** * .**.*.** **. . 

AC SVIDAYQSGKLKLHLGGYILGDSWISPEDFVFSffGPLLKHYSRLDDNGLDSSNSLAEKIK 

HP MYMKGLYREATELWGKAEMI lEQNTDGYNFYN-ILTKSTPTSTMESSLEFTQSHLY— 

**.**. * . * . *. ... *.***.* ** 

AC TQIKNGEYYGATQTWMDLENLISSKSNFVDFYNFLLDTGMDPYSLTTSLKIKKEEKIKKY 

HP — CLCQ-RHVRHLQR-DALSQLMNGPIRKKLKI IPEDQSWGGQATNYFYHMEEDFMKPY 

. » . * *. . **** *. *******. * ** **. . **. . ***** 

AC SRYLNDMSLSDYEDVEGDLDKLMGYIKKKLKI IPNDLIWGNNSDDYFTAMBAAFMKPY 

HP ISIVDELLEAGINYTVYNGQLDLIVDTMGQEAWVmKffPELPKFSQLKm 
». *****. . *. . **. ******. * . * * ***». **. 

AC lEDVDELLATGYDYTlYNGQLDYICSTSGTEAWVHKLR 
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The search of the GenBank using the base sequences 
of the present cDNZV has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No, T59065) among ESTs. However, since they are 
5 partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention , 

<HP03983> (SEQ ID NOS: 66, 76, and 86) 
Detejnxiination of the whole base sequence of the 

10 cDNA insert of clone HP03983 obtained from cDNA library of 
human kidney revealed the structure consisting of a 42-bp 
5' -untranslated region, a 1473-bp ORF, and a 341-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
490 amino acid residues and there existed a putative 

15 secretory signal at the N- terminus and one putative 
transmembrane domain at the C-terminus- Figure 26 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein • Application 
of the ("3,-1) rule, a method for predicting the cleavage 

20 site of the secretory signal sequence, allows to expect that 
the matxire protein starts from glutaitdc acid at position 22 • 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to human ClqR protein (Accession No. 

25 AAB53110) . Table 17 shows the comparison between amino acid 
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sequences of the human protein of the present invention (HP) 
and human ClqR protein (HC) . Therein, the marks of 
and * represent a gap, an amino acid residue identical with 
that of the protein of the present invention, and an amino 
5 acid residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 25.8% in the N-terminal region of 310 amino acid residues. 
Since the positions of 17 cysteine residues are conserved, 
in particular, the two proteins are considered to assume 
10 similar higher-order structures. 
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Table 17 

HP MRPmCLLWQALWGPGGGEHPTADKAGCSASGACYSLHHATMKRQAAEEACILRGGA 
»....*** *.**.». * * 

HC MATSMGLLLLLLLLLTQPGAGTGADTEAWC-VGTACYTAHSGKLSAABAQNHCNQNGGN 

HP LSTVMGAELMVLALL-MGPGPGGGSKDLLFffVALEmSHCTLENEPLRGFSWLSS 
».**...*.*.* **..».*...* . **.%***. 

HC LATVKSKEEAQHVQRVLAQLLRSEAALTAEMSKFfflGLQREKGKCLDPSLPLKGFSWV— 

HP DPGGLESDTLQWVEEPQRSCTAKRC— AVLQATGGVEP — ^AGWKEMRC ^HLRAN 

HC -GGGEDTPYSNWHKELRNSCISKRCYSLLLDLSQPLLPNRLPKffSEGPCGSPGSPGSNIE 

HP GYLCKYQFEVLCPAPRPGAASNLSYRAPFQLHSAALDFSPPGTBVSALC ^RGQLPIS 

»..**. »..*.. * * ,m * * .... 

HC GFYCKFSPKGMCmALGGPGQYTYTTPFQnSSSLEAYPFASAANVACGEGDKDBTQSH 
HP -VTCIADEIGA-RmSGDVLCPCP-GRYLBAGKCAELPNCLD-DLGGFACECATGFE 
HC YFLCKEKAPDYFDWG-SSGPLCYSPKYGCNFNNGGCHQ--DCFBGGDGSFLCGCRPGFR 
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HP LGKDGRSCVTSGE«1PTL(KJTGWTMPATATSPYPQRTWPIRVDEKLGETPLVPEQDN 

HC LLDDLVTCASitNPCSSSPCRGGATCVLGPHGKNnCRCPQGYQLDSSQLDCVDVDECQDS 

HP SVTSIPEIPRWGSQSTMSTLQMSLQAESKATITPSGSVISKFNSnSSATPQAFDSSSAV 

HC PCAQECVNTPGGFRCBCPGYEPGGPGEGACQDVBECALGRSPCAQGCTNTDGSFHCSCE 

HP VFIFVSTAYWLYILimLGLmCFHESPSSQPRKESMGPPGLESDPEPAALGSSSAH 

HC EGYVLAGEDGrQCQDVDECVGPGGPLCDSLCFNTQGSFHCGCLPGWVLAPNGVSCTMGPV 

HP CTNNGVKVGDCDLRD8AEGALLAESPLGSSDA 

HC SLGPPSGPPDEEDKGEKEGSTVPRAATASPTRGPEGTPKATPTTSRPSLSSDAPITSAPL 
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The search of the GenBank using the base sequences 
of the present cDNA. has revealed the registration of 
sequences that shared a homology of 90% or more (for exampie. 
Accession No. R51653) among ESTs. However, since they are 
5 partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention* 

<HP10745> (SEQ ID NOS: 67, 77, and 87) 
Determination of the whole base sequence of the 

10 cDNA insert of clone HP10745 obtained from cDNA library of 
human umbilical cord blood revealed the structure consisting 
of a 261-bp 5' -untranslated region, a 1179-bp ORF, and a 
733-bp 3' -untranslated region. The ORF encodes a protein 
consisting of 392 amino acid residues and there existed nine 

15 putative transmembrane domains. Figure 27 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte-- 
Doolittle method, of the present protein. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 

20 sequences that shared a homology of 90% or more (for example. 
Accession No. R59881) among ESTs, However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

25 <HP10775> (SEQ ID NOS: 68, 78, and 88) 
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Determination of the whole base sequence of the 
cDNA insert of clone HP10775 obtained from cDNA library of 
human kidney revealed the structure consisting of a 30-bp 
5'' -untranslated region, a 1617-bp ORF, and a:> 287-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
538 amino acid residues and there existed a putative 
secretory signal at the N-terminus. Figure 28 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 66 kDa that was larger than the molecular weight of 
55,133 predicted from the ORF. Application of the ("3,-1) 
rule, a method for predicting the cleavage site of the 
secretory signal sequence, allows to expect that the mature 
protein starts from serine at position 23. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AA366320) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

<HP10782> (SEQ ID NOS: 69, 79, and 89) 
Determination of the whole base sequence of the 
cDNA insert of clone HP10782 obtained from cDNA library of 
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human kidney revealed the structure consisting of a 70-bp 
5' -untranslated region, a 309-bp ORF, and a 1501-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
102 amino acid residues and there existed three putative 
5 transmembrane domains. Figure 29 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. 

The search of the GenBank using the base sequences 
of the present cDNA. has revealed the registration of 
10 sequences that shared a homology of 90% or more (for example. 
Accession No. AI815463) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

15 <HP10787> (SEQ ID NOS : 70, 80, and 90) 

Determination of the whole base sequence of the 
cDNA insert of clone HP10787 obtained from cDNA library of 
hutaan kidney revealed the structure consisting of a 54-bp 
5' -untranslated region, a 1329-bp ORF, and a 912-bp 3'- 

20 untranslated region. The ORF encodes a protein consisting of 
442 amino acid residues and there existed one putative 
transmembrane domain at the N-terminus. Figure 30 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein. In vitro 

25 translation resulted in formation of a translation product 
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of 50 kDa that was almost identical with the molecular 
weight of 50,562 predicted from the ORF. In this caser the 
addition of a microsome led to the formation of a product of 
56 kDa. In addition, there exists in the amino acid sequence 
5 of this protein four sites at which N-glycosylation may 
occur (Asn-Leu-Thr at position 83, Asn-Phe-Thr at position 
89, Asn-Ala-Ser at position 113 and Asn-Lys-Ser at position 
151) . 

The search of the protein database using the amino 
10 acid sequence of the present protein revealed that the 
protein was similar to rat PV-1 (Accession No. AM)41524) . 
Table 18 shows the comparison between amino acid sequences 
of the human protein of the present invention (HP) and rat 
PV-1 (RP) . Therein, the marks of *, and . represent a gap, 
15 an amino acid residue identical with that of the protein of 
the present invention, and an amino acid residue similar to 
that of the protein of the present invention, respectively. 
The both proteins shared a homology of 61.1% in the entire 
region. 
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Table 18 

HP MGLAMBHGGSYARAGGSSRGCWYYLRYFFLFVSLIQFLI ILGLVLFMVYGNVHYSTESNL 

8P MGLSMDR-SPYSRTGDRDRGCWmRYFFLFVSLIQFLIILGLVLFMIYGNVHAnESSL 

HP QAT^RRAEGLYSQLLGLTASQSNLmNFTTRABAIMQMNARRDLDRINASFRQCQ 
. **» **. . . *». *. *. **. *. **..*. **. .** *.. **. . . ********** 
SP RATEIRADNLYSQWGLSM(mSK(lLNISTLVH)TVMQQLLTTMEVERINASFRQCQ 

HP GDRYIYTNNQRYMAAI ILSEKQCRDQPKDMNKSCDALLFMLNQKVKTLEVEIAKEKTICT 
**..*.*.*.. . *. *. »*. *. ***». ». . *. . . ». 

RP GDLiniNYNRFIAAI ILSEKQCQEQLKEGNKTCEALLFKLGEKVKTLEMEWKEKAYCS 

HP KDKESmmVAEEQLVECVKmLQHWLWffiQLQKVQALCLPLDKDKFEMDLRNL 

RP PKDSLLAGKRQAEMQQEACGKMEQQKQDQQVreEQLRKVQSLCLPLDQElffQAD^^ 

HP WRBSI IFRSLDEGYNLYHPLGSELASIRRACDHMPSLMSSKVBELARSLRADIERVARE 
. **. ».».*..». **. *. ***. *» 

RP WRDSLraSLDNIGYH-Y-SLMPBFSSLRRTCESLPGlMTTKYEELARGLRAGIERTO 

HP HSDLQRmEAQQGLRASQEAKQKVEmiWREAKLQAECSRQTQLALEEKAVLRKERDN 

*. . *. u*n ** *♦**♦*. . *. . **. **. . **. 

RP NGELRRQEELERAIQGEREARtRAGTEAQARETQLRTECARQTQLALEEKAALRTQRDD 

HP LmEEKKREAE(lLRMELAIRNSALDTCIKTKSQPMMPVSRPMGPYPNPQPlDPASLEE 
*. **♦ **** *. . . * **»*♦*. *.***...* ** ********* 
RP LERQLEARKRELEQLRTEVDVRI SALDTCVKAKSLPAIQ-PRLPGPPPNPPPIDPASLEE 
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HP FKRKILESQRPPAGIPYAPSSG 
. ******** *- . *• ** 
8P FKKRILESQRPPLVNPAYPPSG 

Furthermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 
registration of sequences that shared a homology of 90% or 
more (for example. Accession No. AL041217) among ESTs, 
However, since they are partial sequences, it can not be 
judged whether or not they encode the same protein as the 
protein of the present invention. 

<HP03977> (SEQ ID NOS : 91, 101, and 111) 
Determination of the whole base sequence of the 
cDNA insert of clone HP03977 obtained from cDNA library of 
human kidney revealed the structure consisting of a 35-bp 
5' -untranslated region, a 684-bp ORF, and a 1175-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
227 amino acid residues and there existed a putative 
secretory signal at the N-terminus and one putative 
transmembrane domain at the C- terminus. Figure 31 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 29 kDa that was larger than the molecular weight of 
25,926 predicted from the ORF- -Z^plication of the (-3,-1) 
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rule, a method for predicting the cleavage site of the 
secretory signal sequence, allows to expect that the mature 
protein starts from leucine at position 30. 

The search of the protein database using the amino 
5 acid sequence of the present protein revealed that the 
protein was similar to human gp25L2 (Accession No. 07^^62380) • 
Table 19 shows the comparison between amino acid sequences 
of the hiunan protein of the present invention (HP) and hioman 
gp25L2 (GP) . Therein, the marks of *, and . represent a 
10 gap, an amino acid residue identical with that of the 
protein of the present invention, and an amino acid residue 
similar to that of the protein of the present invention, 
respectively- The both proteins shared a homology of 78.5% 
in the region other than the N-terminal region. 



15 
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Table 19 . 

HP MGYGAGPLRAMGROALLLLALCATGAQGLYFHIGETEKRCFIEEIPDETMVIGNYRT(lM 

GP MRTLLLVLEATRGS-ALYFHIGETEKKCFIEEIPDETWIGNYRTQL 

HP TOQOWLPSTPGLGMVEVKDPDGKmSRQYGSEGliFTFTSHTPGLHQICLHSNSTR 

GP YDKQSEEYQPATPGFGMCVEVKDPBDKVILAREYGSEGRFTFTSHTPGEHQICLHSHSTK 

HP mFAGGKLRmDIQYGEHAMYPEIAAPKLTELQLmOLLDQVEQIQKEQDYQRYR 
. . i^^^itUUU****.*. **. **#**. ***. . * 

GP FSLFAGGMLRVHLDIQVGEHAHDYAEIPAKDKLSELQLRVRQLYEQVEQiOKEQNYQRffR 

HP EERFELTSESTNQRVLWSIAQTVILILTGIWQMRHLKSFPEAKKLV 
HH* ************** **. **. . *. **************** 

GP EERFRQTSESTNQRVLWSILQTLILVAIGVWQMRHLKSFFEAKKLV 
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The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example> 
Accession No, AR052481, U.S. Patent No. 5831052) in patent 
5 data* However, since they are partial sequences, it can not 
be judged whether or not they encode the same protein as the 
protein of the present invention. 

<HP10649> (SEQ ID NOS: 92, 102, and 112) 
Determination of the whole base sequence of the 

10 cDNPl insert of clone HP10649 obtained from cDNA library of 
human epidermoid carcinoma cell line KB revealed the 
structure consisting of a 114-bp 5' -untranslated region, a 
1059"bp ORF, and a 1240-bp 3' -untranslated region. The ORF 
encodes a protein consisting of 352 amino acid residues and 

15 there existed one putative transmembrane domain. Figure 32 
depicts the hydrophobicity/hydrophilicity profile, obtained 
by the Kyte^Doolittle method, of the present protein. In 
vitro translation resulted in fonaation of a translation 
product of 40 kDa that was almost identical with the 

20 molecular weight of 39,774 predicted from the ORF. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to Epiphyas postvittana 
nucleopolyhedrovirus apoptosis inhibitor iap-1 (Accession 

25 No. AAD19698) . Table 20 shows the comparison between amino 
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Table 20 



HP MESGGRPSLC(ff ILLGTOYVTmYSmOKARYSQELKGAKPHLGEDLKSILSEAPG 



HP KCVPYAVIEGAmVKEllNSQFYENCKGVIQM,TLQEHKMVWNRimWNDCSKI IHQR 

EP MSATSPLYI INVCBNAHEYSAEPFHYLIERHNSFEHYPIDHYAFVNSLI INGF 
HP TNTWFDLWHEDGVDVAVRVLmDSVDLGLETVYEKFHPSIQSFTOVIfflYISGESPK 

EP RYQNVDDAmEYCSAVIKNMDDCWVHATLSPYCVYAMIAQNENFANNLSTNAFL 



HP GIQETEEMLKVGATLTGVGELYiDMSmOPPKQflQYlilSSQDFDSLLQRQESSYRLW 
EP YTPGKPICVYSRLTHTNARKSTFEDYWFAALQHLYANISEAGMFHmGDETACFFCDCR 
HP KVLALVFGFATCATLFFILRKQYLQRQBHLRLKQMQEEFQEHEAQLLSRAKPEDRESLKS 



EP VKDffLPNDDPWQRHAIANPQCYFVVCIKGDEFCNAVRmLAPLQSVVALEHVSllDEM 



HP ACWCLSSFKSCYFLECGHVCSCTECYRALPEPKKCPIGRQAITRVIPLYNS 

*.»*....*.**»**.*»** . ***. **». . * ... 
EP ECKI CLBRQRDTVLLPCSHFCYCMQCYFAL-DMCPTCRQDVTDFVKIFW 
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acid sequences of the hiiman protein of the present invention 
(HP) and Epiphyas postvittana nucleopolyhedrovirus 
apoptosis inhibitor iap-1 (EP) . Therein, the marks of *, 
and . represent a gap, an amino acid residue identical with 
that of the protein of the present invention, and an amino 
acid residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 40.8% in the C-terminal region of 49 amino acid residues. 
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Furthermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 
registration of sequences that shared a homology of 90% or 
more (for example, Accession No. T50032) among ESTs. However, 
5 since they are partial sequences, it can not be judged 
whether or not they encode the same protein as the protein 
of the present invention. 

<HP10779> (SEQ ID NOS: 93, 103, and 113) 
Determination of the whole base sequence of the 

10 cDNA. insert of clone HP10779 obtained from cDNA library of 
human kidney revealed the structure consisting of a 34-bp 
5' -untranslated region, a 393-bp ORF, and a 1949-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
130 amino acid residues and there existed two putative 

15 transmembrane doiaains* Figure 33 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. 

Furthermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 

20 registration of sequences that shared a homology of 90% or 
more (for example. Accession No. AL042495) among ESTs. 
However, since they are partial sequences, it can not be 
judged whether or not they encode the same protein as the 
protein of the present invention. In addition, this gene was 

25 mapped on chromosome 9q34 (Accession No." AC001644) . 
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<HP10790> (SEQ ID NOS: 94, 104, and 114) 
Determination of the whole base sequence of the 
cDNA insert of clone HP10790 obtained from cDNA library of 
human kidney revealed the structure consisting of a 109-bp 
5 5' -untranslated region, a 993-bp ORF, and a 53-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
330 amino acid residues and there existed one putative 
transmembrane domain. Figure 34 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 

10 Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 34 kDa that was smaller . than the molecular weight of 
36,642 predicted from the ORF. 

The search of the GenBank using the base sequences 

15 of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AW241940) among ESTs, However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 

20 invention. 

<HP10793> (SEQ ID NOS: 95, 105, and 115) 
Determination of the whole base sequence of the 
cDNA insert of clone HP10793 obtained from cDNA library of 
human kidney revealed the structure consisting of a 70-bp 
25 5' -untranslated region, a 1053-bp ORF, and a 206-bp 3'- 
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untranslated region. The ORF encodes a protein consisting of 
350 amino acid residues and there existed a putative 
secretory signal at the N-tenainus and one putative 
transmembrane domain in the inner portion* Figure 35 depicts 
5 the hydrophobicity/h^drophilicity profile^ obtained by the 
Kyte-Doolittle method/ of the present protein. In vitro 
translation resulted in formation of a translation product 
of 40 kDa that was somewhat larger than the molecular weight 
of 37,134 predicted from the ORF, implication of the (-3,-1) 

10 rule, a method for predicting the cleavage site of the 
secretory signal sequence, allows to expect that the mature 
protein starts from glycine at position 25. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 

15 seqaences that shared a homology of 90% or more (for example. 
Accession No. AA326569) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

20 <HP10794> {SEQ ID NOS: 96, 106, and 116) 

Determination of the whole base sequence of the 
cDNA insert of clone HP10794 obtained from cDNA library of 
human kidney revealed the structure consisting of a 146--bp 
5' -untranslated region, a 342-bp ORF, and a 899-bp 3'- 

25 untranslated region. The ORF encodes a protein consisting of 
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113 amino acid residues and there existed one putative 
transmembrane domain. Figure 36 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
5 translation resulted in foinaation of a translation product 
of 14 kDa that was almost identical with the molecular 
weight of 12,017 predicted from the ORF. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
10 sequences that shared a homology of 90% or more (for example. 
Accession No. AI346561) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

15 <HP10797> (SEQ ID NOS : 97, 107, and 117) 

Determination of the whole base sequence of the 
cDNA insert of clone HP10797 obtained from cDNA library of 
h\man kidney revealed the structure consisting of a 129-bp 
5' -untranslated region, a 570-bp ORF, and a 459'bp 3'- 

20 untranslated region. The ORF encodes a protein consisting of 
189 amino acid residues and there existed a putative 
secretory signal at the N-terminus. Figure 37 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

25 translation resulted in formation of a translation product 
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of 22 kDa that was almost identical with the molecular 
weight of 21,053 predicted from the ORF. Application of the 
(-3,-1) rule, a method for predicting the cleavage site of 
the secretory signal sequence, allows to expect that the 
mature protein starts from glutamine at position 23. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AA356938) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. In addition, this gene was mapped on chromosome 4 
(Accession No. AC004067) . 

<HP10798> (SEQ ID NOS: 98, 108, and 118) 
Determination of the whole base sequence of the 
cDNA insert of clone HP107 98 ^ obtained from cDNA library of 
human kidney revealed the structure consisting of a 25-bp 
5' -untranslated region, a 834'-bp ORF, and a 247-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
277 amino acid residues and there existed seven putative 
transmembrane domains. Figure 38 depicts . the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 27 kDa that was smaller than the molecular weight of 
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30,685 predicted from the ORF. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example, 
5 Accession No. H92084) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention , 

<HP10800> (SEQ ID NOS: 99, 109, and 119) 

10 Determination of the whole base sequence of the 

cDm insert of clone HP10800 obtained from cDNA library of 
. hioman kidney revealed the structure consisting of a 158-bp 
5' -untranslated region, a 825-bp ORF, and a 924-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 

15 274 amino acid residues and there existed one putative 
transmembrane domain. Figure 39 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

20 of 33 kDa that was somewhat larger than the molecular weight 
of 31,108 predicted from the ORF. In this case, the addition 
of a microsome led to the formation of a product of 45 kDa, 
In addition, there exists in the amino acid sequence of this 
protein five sites at which N-glycosylation may occur (Asn- 

25 Ile-Thr at position 145, Asn-Ile-Thr at position 151, Asn- 
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Ile-Thr at position 164, Asn-^Ile-Thr at position 183, and 
Asn-Thr-Thr at position 256) . 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
5 sequences that shared a homology of 90% or more (for example. 
Accession No. AA729308) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

10 <HP10801> (SEQ ID NOS : 100, 110, and 120) 

Deterraination of the whole base sequence of the 
cDNA insert of clone HP1G801 obtained from cDNA library of 
human kidney revealed the structure consisting of a 133--bp 
5' -untranslated region, a 1173-bp ORF, and a 510-bp 3'- 

15 untranslated region. The ORF encodes a protein consisting of 
390 amino acid residues and there existed a putative 
secretory signal at the N-terminus and one putative 
transmembrane domain in the inner portion. Figure 40 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 

20 Kyte-Doolittle method, of the present protein. In vitro 
translation with the addition of microsome resulted in 
formation of a product of 50 kDa that was larger than the 
molecular weight of 41,097 predicted from the ORF. In 
addition, there exists in the amino acid sequence of this 

25 protein five sites at which N-glycosylation may occur {Asn- 
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Leu-Ser at position 108, Asn-Val-Thr at position 169, Asn- 
Leu-Ser at position 213, Asn-Val-Thr at position 236 and 
Asn-Gly-Thr at position 307). Application of the (-3,-1). 
rule, a method for predicting the cleavage site of the 
5 secretory signal sequence, allows to expect that the mature 
protein starts from glutamine at position 30, 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to human A33 antigen (Accession No. 

10 NP 005805) . Table 21 shows the comparison between amino acid 
sequences of the huiaan protein of the present invention (HP) 
and human A33 antigen (HA) . Therein, the marks of -, and . 
represent a gap, an amino acid residue identical with that 
of the protein of the present invention, and an amino acid 

15 residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 28.7% in the intermediate region of 265 amino acod 
residues. 
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Table 21 

HP MI SLPGPLVTNLLmFLGLSALAPPSMMffl-PAMLQAVEGGEyVLPAWY-TLHGE 

HA MVGKMffPVLWTLCAYRVTVDAISVETPQDVLSASQGKSYTLPCTYHTSTSS 

HP VSSSQPWEVPFVMWFFKQKEffiDQVLSYINGVTTSKPGVSLVYSMPSRNLSLRLBGLQEK 

...*... . .**.*.***.. ..*...* 

HA IffiGLIQWDKLLLTHTBRVVIWPFSNKNYIHG-ELYKNRVSISNNAEQSDASITIDQLTMA 

HP DSGPTSCSVNVQDKQGKSRGHSIKTLELNVLVPPAPPSCRLQGVPHVGANVTLSCQSPRS 
*. *. *. *»*. . . . »****. *.*..*,*.*. *. *** . 

HA DNGTYECSVSL — MSDLEGNTKSRVRLLVLVPPSKPECGIEGETI IGNNIQLTCQSKEG 

HP KPAVQYQTORQLPSFQTFFAPALDYIRGSLSLTNLSSSMAGYYVCKAHNEVGTAQCNVTL 
.*. *»».* . . . * . ..**.*.*...**.*..**»*. **. *. 
HA SPTPQYSWKR-YNILNQEQPLA(JPASGQPVSLKNISTDTSGYYICTSSNEEGTQFCNITV 

HP EV-STGPGAAWAGAWGTLVGLGLLAGLVLLYHCRGKALBEPANDIKEDAIAPRTIPWP 

.**.. **** **»* . » 

HA AVRSPSMNYALYVGIAVGWAALI I IGI 1 1 YCCCCRGK — ^DDHTEDKEDARPNBEAYEB 

HP KSSDTISKNGTLSSVTSARALRPPHGPPRPGALTPTPSLSSQALPSPSLPnDGAHPQPI 
HA PPEQLRELSREREEEDDYRQEEQRSTGSESPDHLDQ 
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Furthermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 
registration of sequences that shared a homology of 90% or 
more (for example. Accession No- R33685) among ESTs, However, 
5 - since they are partial sequences, it can not be judged 
whether or not they encode the same protein as the protein 
of the present invention. 

<HP03696> (SEQ ID NOS: 121, 131, and 141) 
Determination of the whole base sequence of the 
10 cDNA insert of clone HP03696 obtained from cDNA library of 
human umbilical cord blood revealed the structure consisting 
of a 184-bp 5' -tmtranslated region, a 1188^bp ORF, and a 
589-bp 3' -untranslated region. The ORF encodes a protein 
consisting of 395 amino acid residues and there existed one 
15 putative transmembrane domain. Figure 41 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
20 protein was similar to rat cell surface glycoprotein GP42 
(Accession No. P23505) . Table 22 shows the comparison 
between amino acid sequences of the hioman protein of the 
present invention (HP) and rat cell surface glycoprotein 
GP42 (RC) . Therein, the marks of -/ and . represent a gap, 
25 an amino acid residue identical with that of the protein of 
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the present invention, and an amino acid residue similar to 
that of the protein of the present invention, respectively. 
The both proteins shared a. homology of 46.1% in the 
intermediate region of amino acid residues 62-280. 
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Table 22 

HP MSMYmSGmQMIPSFKBHQTLSMGAATVQSRGQYSCSGQVMYIPQTFTQTSB 

RC MLLMVLLLC 

HP TAMVQVQELFPPPYLSAIPSPEPREGSLVTLRCQmHPLRSALRLLFSFHKDGHTLQDR 
. *. . . *♦**. »*** . *.»... . *.»»*,.»..* u. *..*,* 

RC VSMTEAQELFQDPVLSRLNSSETSD — ^LLLKCnKVDPNKPASELFYSFYKDMI IQHR 

HP GPHPELCIPGAKEGDSGLTWCEYAPEGGQVQKQSPQLEVRVQAPySRPVLTLHHGPADPA 
. . . * . *..*.*.. **** **....*.**.* * *». *♦***. *. . . . * 

RC SHNPLFFISEANEENSGLYQCVVDAeGTIQKKSDYLDIDLCTSySQPVLTLQHEATNLA 

HP VGDMVQLLCBAQRGSPPIlYSFYlDEKIVGNHSAPCGGTTSLLFPVKSEeDAGNYSCBAB 

RC EGDKVKFLCETQLGSLPILYSFYMDGBILGEPLAPSGRAASLLISVKAEWSGKNYSCQAE 

HP NSVSRERSEPKKLSLKGSQVLFTPASMYPWLPAS-LLGLMVIAAALLVYVRSWRKAGP 
*. . ♦ * * *. * 

RC NKVSRDISEPKKFPLYVSGTASMSTT-VYIWLPVSCLVGWPWLLRF 
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Furthermore, the search of the GenBank using the 
base sequences of the present cDNA has revealed the 
registration of sequences that shared a homology of 90% or 
more (for example. Accession No. AA446524) among ESTs. 
However, since they are partial sequences, it can not be 
judged whether or not they encode the same protein as the 
protein of the present invention, 

<HP03882> (SEQ ID NOS: 122, 132^ and 142} 

Determination of the whole base sequence of the 
cDNA insert of clone HP03882 obtained from cDNA library of 
human kidney revealed the structure consisting of a 57-bp 
5' -untranslated region, a 1653-bp -ORF, and a 484-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
550 aioino acid residues and there existed ten putative 
transmembrane domains. Figure 42 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of high molecular weight. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to mouse solute carrier family 22 
{cation transporter) -like protein (Accession No. NP_033229) . 
Table 23 shows the comparison between amino acid sequences 
of the human protein of the present invention (HP) and mouse 
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solute carrier family 22 (cation transporter) -like protein 
(MS) . Therein, the marks of *r and - represent a gap, an 
amino acid residue identical with that of the protein of the 
present invention, and an amino acid residue similar to that 
5 of the protein of the present invention, respectively. The 
both proteins shared a hpmology of 48,9% in the entire 
region. 
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Table 23 

HP MAFSKLLEQAlXiVGLFQTLQVLTFILPCLMIPSmENFSMIPGMCWTllJLDN— G 

MS MAFPELLDRVGGLGRFQLFQmVTPILWYnONMLENFSAAWHHRCWVPLLDNSTSQ 

HP SAVSTMTPKALLTISIPPGPNQGPHQCKRFRQPQWQLLDPMTATSVSEADTEPCVDGW 
*..**.. »****♦. » **** ********* . . *****. **. *, **f^ i^ii 

MS ASIPGDLGPDVLLAYSIPPGPDQQPHQCLRFRQPQWQLTESNATATHWSDAATEPCEDGW 

HP YYDRSVFTSTIVAKWDLVCSSQGLKPLSQSIFMSGILYGSFIWaLSYBFGRKPMLSWCC 
***. *. * ****. . **»**. **. *. *. . ***». . *****. . * * **♦*. ..*.*.. 

MS YYDHSTFRSTIYTTWDLVCNSQALBPMAQSIFLAGILYGAAVCGHASDRFGSERVLTWSY 

HP LQLAVAGTSTIFAPTFVIYCCLRFVAAFGMAGIFLSSLTLMVEmrSRRAYTMTWGCA 

*..*.**.. ****.**.*». * . . ** *. . **♦ **. . . 

MS LLVSVSGTAAAFMPTFPLYCLFRFLLASAYAGVMMNTASLLMEffTSAQGSPLWTLNALG 

HP FSAGQAALGGLAFALRDWRTLQLAASYPFFAISLISmPESARWLI IKGKPDQALQELR 
** **. *..*...». ** ***♦. *. »** . . ************. ** **. ****. 

MS FSFGQYLTGSYAYGVRSWRMLQLAYSAPPFLFFYYSWWLPBSARffLITYGKLDQGLQELQ 

HP KYARIHGHK-EAKHLTIEYLMSSYKEEYASAKEPRSYLDLFCYPYLRWRSCAMLYVHFSL 
.**.*.**...*». ***. *...**...*. *..*..****. . . *. . 
MS RVAAVNRRKAEGDTLTMEYLRSAMEEEPSRDKAGASLGTLLHTPGLRHRTI ISMLCffFAF 

HP LISYYGLVFDLQSLCRDIFLIQALFGAYDFLGRATTALLLSFLGRRTIQAGSQAMAGLAI 
...***..***.**..»***♦♦».**** .... **.***** * ** * 

MS GFTFYGLALDLQALGSNIFLLQALIGIVDFPYKTGSLLLISRLGRRLCQYSFLYLPGLCI 
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HP LAmVPQDLQTLRWAYLGKGCFGISLTCLTIYKAELFPTPVIMTADGim 

*. *. *»*. . . . ** . . **** * . . **. **. . . ***** . **** * * ** 

MS LSNILYPHGMGVLRSALAVLGLGCLGGAFTCITIFSSELFPTVIRMTAYG 

^ HP MGPLILMSRQALPLLPPLLYGVISIASSLmFFLPBTQGLPLPDTim^ 

*.*»*. , . *.*,.*.****..*********...*. . . 

MS MLGPLmLGmsmLnGVVPVLSGLAAL-LLPETmPLPDTIQDI^^ 

HP GNRQEAVTVESTSL 
**.* 

10 

MS HDTPDGSILMSTRL 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example, 
15 Accession No, AI242210) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

<HP03903> (SEQ ID NOS : 123, 133, and 143) 
20 Detemination of the whole base sequence of the 

cDNA insert of clone HP03903 obtained from cDNA library of 
human kidney revealed the structure consisting of a 108-bp 
5' -untranslated region, a 657-bp ORF, and a 1988-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
25 218 amino acid residues and there existed three putative 
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transmembrane domains. Figure 43 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in foarmation of a translation product 
of 26 kDa that was somewhat larger than the molecular weight 
of 23,487 predicted from the ORF. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to mouse prominin (Accession No. 
NP 032961) , Table 24 shows the coit^arison between amino acid 
sequences of the human protein of the present invention (HP) 
and mouse prominin (MP), Therein, the marks of and . 

represent a gap, an amino acid residue identical with that 
of the protein of the present invention, and an amino acid 
residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 27.6% in the region other than the N--terminal and C- 
terminal regions. 
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Table 24 

HP MKffrLJU.LmLGLGLaALS(U,MGATDCKFLGPAEHLTFTPAAmWLAPRVRAPG 

.* ..: * *.* * 

MP MALVFSmLLGLCGKISSEG{ffAFJDn"PGAMNYELPT-TKYETeDTFMGIV 
DSLYGTVRKFLSWQLNPFPSELmL-NELA-SVKYNEYVRYEAGYVVCAVIAGLYL 
MP GPLTKMYHIFLMQPl©FPLDLmiQNKNFDISVDSKEIALYEIGVLICAILGLLFI 
HP LLVPTAGLCFCCCRCHRRCGGRVKTEffi-ALACBRAALMVFLLimLLLIGWCAPm 
MP ILMPLVGCFFCMCRCCNKCGGEMHQRQKQHAPCRBKCLGLSLLVICLLMSLGI lYGFVAH 
HP QRTHEQMGPSIEAMPETLLSLVGLYSDVPQVSTVTPHPHVPL 

t t ± ± 

MP QQTRraiKGTQKLAKSNFRDFQTLLTETPKQIDYVVEQYTNTKNKAFSDLDGIGSYLGGR 
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The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AI792608) among ESTs. However, since they are 
5 partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

<HP03974> (SEQ ID NOS: 124, 134, and 144) 
Determination of the whole base sequence of the 

10 cDNA insert of clone HP03974 obtained from cDNA library of 
human kidney revealed the structure consisting of a 41-bp 
5' -untranslated region, a 1791-bp ORF, and a 253-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
596 amino acid residues and there existed twelve putative 

15 transmembrane domains. Figure 44 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of high molecular weight. 

20 The search of the protein database using the amino 

acid sequence of the present protein revealed that the 
protein was similar to rabbit {Oryctolagus cuniculus) 
sodium/glucose cotransporter protein (Accession No. 
AAA66065) . Table 25 shows the comparison between amino acid 

25 sequences of the human protein of the present invention (HP) 
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and rabbit sodium/ glucose cotransporter protein (OC) . 
Therein, the marks of *, and . represent a gap, an amino 
acid residue identical with that of the protein of the 
present invention, and an amino acid residue siiailar to that 
5 of the protein of the present invention, respectively. The 
both proteins shared a homology of 89.1% in the entire 
region . 



wo 01/49728 



PCT/JPOO/09359 



143 

Table 25 

HP M-AANSTSDLHTPGTQLSVADI IVITVYFALNVAVGIWSSCRASRNTYNGYFLAGRDMTff 
* *. ***** *. **. ****. **. *************************, *********** 

OC MVADNSTSDPHAPGPQLSVTDiniTYYFALmYGIWSSCRASfiOTVSGYFLAG^ 

HP WPIGASLFASSEGSGLFIGLAGSGAAGGLAVAGFBWNATYVLLALAWVFVPIYISSBIVT 
********^ *************************^ ************** . ********* 

OC WPIGASLFGSSEGSGLFIGLAGSGAAGGLAVAGFDWNATYVLLALAWVFGAIYISSEIVT 

HP LPEYIQKRYGGQRIRmSYLSLLLSVFTKISLDLYAGALFVHICLGWNFYLSTILTLGI 
*. *************************************************, * 

OC LAEYIQKRFGGQRIRMYLSVLSLLLSVFTKISLDLYAGALFVHI CLGWNFYLSTIITLTI 

HP TALYTIAGGLAAVIYTDALQTLIMWGAVILTIKAFDQIGGYGQLEAAYAQAIPSRTIAN 
******, ********************_ ****, **_ ****, *****, ******, ** 

OC TALYTITGGLVAYIYTDALQTLIMWGAVILAIKAFHQIDGYGQMEAAYARAIPSmAN 
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HP nCHLPRTDiUeiFfiDPHTGDLPWOTGLTMTWyWCTDQVIVQRSLSmNHm 
tu****. *********. ****** 
OC . nCHLPRJ«)AMHMFSDPTODLPWTGMTFGLTIMATWTWCTT)QVIVQRSLSmNHAKA 

HP GSILASYLKMLPMGLI IMPGMISRALFPDDVGCWPSBCLRACCAEVGCSKIAYPKLVME 

**ttt**********. *************, ****************, **tif*lfU***** 

OC GSILASYLKMLPMGLMIMPGMISMLFPDEVGCWPSECLRACGAEIGCSNIAYPKLVMB 

HP LMPIGLRGLMIAmAALMSSLTSIFNSSSTLFTMDITOLRPRSGERELLLYGEYIVA 
tttm^tu. . ******. ******************** . . 

OC LMPVGLRGLMIAWALMSSLSSIFNSSSTLFTMDITOLRPCASEMLLYGEYIW 

HP LIGVSYAWIPYLQDSNSGQLFIYMQSVTSSLAPPVTAVFYLGVFffRRANEQGAFirGLIAG 
*tt**********. **. **********************. **. **. ******n***. ** 

OC LIGVSVAfflPVLQGSNGGQLFIYMQSVTSSlAPPVTAYFTLGIFffQRANEQGAFWGLLAG 

HP LVVGATRLYLEFLNPAPPCGEPDTRPAVLGSIHYLHFAYALFALSGAYYYAGSLLTPPPQ 
*. ***i^uttttt, ******. . *******. . **********, *. ***, *, 

OC LAYGATRLYLEFLHPAPPCGAADTRPAYLSQLHYLHFAYALFYLTGAYAYGGSLLTPPPR 
HP SVQIENLTWWTLAQDYPLGTKAGDGQTPQKHAFWARVCGFMILLMCVKIFFYAYFA 

. *******ifi^^^ . , tit*U****. , . ************************* 

OC RHQIENLTffmTRDLSLGAKAGDGQTPQRYTFffARVCGFHAILLMCYNIFFyAYFA 
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The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AI793336) among ESUs. However, since they are 
5 partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention * 

<HP03978> {SEQ ID NOS: 125, 135, and 145) 
Determination of the whole base sequence of the 

10 cDNA insert of clone HP03978 obtained from cDNA library of 
himian kidney revealed the structure consisting of a 99-bp 
5' -untranslated region, a 1404-bp ORF, and a 705-bp 3'- 
untranslated region. The ORF encodes a protein consisting of 
467 amino acid residues and there existed a putative 

15 secretory signal at the N-terminus. Figure 45 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 55 kDa that was somewhat larger than the molecular weight 

20 of 52,352 predicted from the ORF. In this case, the addition 
of a microsome led to the formation of a product of 57 kDa. 
In addition, there exists in the amino acid sequence of this 
protein two sites at which N-glycosylation may occur (Asn- 
Arg-Thr at position 78 and Asn-His-Ser at position 161) * 

25 Application of the (-3,-1) rule, a method for predicting the 
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cleavage site of the secretory signal sequence, allows to 
expect that the mature protein starts from alanine at 
position 22. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 
protein was similar to human tubulo-interstitial nephritis 
antigen (Accession No. RZ\A84949) . Table 26 shows the 
comparison between amino acid sequences of the human protein 
of the present invention (HP) and human tubulo-interstitial 
nephritis antigen (TA) . Therein, the marks of and . 

represent a gap, an amino acid residue identical with that 
of the protein of the present invention, and an amino acid 
residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 50,0% in the region other than the N-terminal region. 
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Table 26 , 

HP MWRCPLGLLLLLPLAGHLALGAQQGRGRRELAPGLHLRGISDAGGRYCQEQ 

TA MWTGIKILIFSYLnEITOKQUSQREmEAYFTRNHTVLQGTRFKRAIFQGQYCRNF 

HP DLCCEGEADDCALP-YLG-AICYCDLPCNRTVSDCCPDFWDPC — ^LGVPPPFPP-IQG 
.»*.*.*.». * . *. **** **. *. ******. . *» .*»..* . * 
TA G-CCED8DDGCVTEFTAANALCYCDKFCDEENSDCCPDYKSFCBEEKEffPPHTQPWYPEG 

HP CMHGGRIYPYLGTYWDNCNRCTCQENRQWQCDQEPCLVDPDMIKAINQGNYGWQAGNHSA 
*...*.* . . . ***. »**... **. *. *. *** *..*...*. *. *** * *. *. 
TA CFKDGQHYEEGSVIKENCNSCTC-SGQQWKCSQHVCLVRPELIEQVNKGDYGWTAQNYSQ 

HP FWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWPNLIHEPLDQG 
******. . *. . . ****. . ** ...*.*. . * . . . **. * ** . *. **** 
TAFWGMTLEDGFKFRLGTLPPSLMLLSMNEMTASLPATTDLPEFFYASYKWPGffTHGPLDQK 

HP NCAGSWAFSTAAVASDRVSIHSLGirrPYLSPQNLLSCDTHQQQGCRGGRLDGAWWFLRR 
**».*♦**»**.*».♦*..*.**. *. ******.** .....**..*..* ***.**. 
TA NCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRK 

HP RGWSDHCYPFSGRERDEAGPAPPCimSRAMGRGKRQATAHCPHSYVMNDIYQVTPYY 
*n. *»**.*****.»*.***. ..»m.** 

TA RGLVSHACYPLF— PQMTNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCSPPY 

HP ELGSNDKEIMBLMBNGPVQALMEVHEDFFIYKGGIYSHTPVSLGRPERYSSHGTHSVKI 
. . inn. *. *. ****** **. *. . . . *. **. **. **. 
TA RYSSNETEIMKBIMQNGPYQAIMQWBDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKL 



wo 01/49728 



PCT/JPOO/09359 



148 

HP TGWGBmPDGRTLKYWTAMSWGPAWGBRGmiVRGVNECDIESFYLGWGRV^^^ 
TA TGWGTLRGAQGQKEKFWI AANSWGKSWGENGYFRILRGYNESDIEKL 
HP GHH 

5 

TAP 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
10 sequences that shared a homology of 90% or more (for example. 
Accession No. R48402) among ESTs, However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

15 <HP10735> (SEQ ID NOS : 126, 136, and 146) 

Determination of the whole base sequence of the 
cDNZV insert of clone HP10735 obtained from cDNA library of 
human umbilical cord blood revealed the structure consisting 
of a 370-bp 5' --untranslated region, a 1431-bp ORF, and a 

20 243-bp 3' -untranslated region. The ORF encodes a protein 
consisting of 476 amino acid residues and there existed ten 
putative transmembrane domains. Figure 46 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. 

25 The search of the protein database using the amino 
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acid sequence of the present protein revealed that the 
protein was similar to Caenorhabditis elegans tetracycline 
resistance protein-like protein (Accession No. CAA94337) . 
Table 27 shows the comparison between amino acid sequences 
5 - of the human protein of the present invention (HP) and C. 
elegans tetracycline resistance protein--like protein (CP) . 
Therein, the marks of and . represent a gap, an amino 

acid residue identical with that of the protein of the 
present invention, and an amino acid residue similar to that 
10 of the protein of the present invention, respectively • The 
both proteins shared a homology of 51.5% in the intermediate 
region of 196 amino acid residues* 
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Table 27 

HP MGSDTAPFLSQAD1)PDDGPVPGTPGLPGSTGNPKSEEPEVPDQEGL(»I^^ 

CP MVNSQQDYI 

HP IVAVLCYira.LNYMDlffmGVLPDIE(ffFNIGDSSSGLIQTVFISSYMVLAPVF6YLGD 
«. . * , »*. ****** **.**. *******. *. **. . ** ***** 

CP SVTALFVmLNYVDRYTVAGVLTQVQTYYNISDSLGGLIQTVFLISFMVFSPVCGYLGD 

HP RYMjmMCGGIAFfSLVTLGSSFIPGBHFWLLLLrRGLVGVGBASYSTIAPTlIADLPV 
*. *»». . * *. . . * . . **»**. *. . ****. *. ». . **. ******. . **. **. * 

CP RFHKKWIMI IGVGIWLGAYLGSSFVPANHFILFLYLRSFVGIGEASYSNVAPSLISDMFN 

HP ADQRSRMLSIFYFAIPVGSGLGYIAGSKVKDMGDffHffALRVTPGLGWAVLLLPLWRE 
...**.. *************. *.**.»...».*.*..*».. * * ♦ . t 

CP GQKRSTVFMIFYFAIPVGSGLGFIYGSNVATLTGHWQWGIRVSAIAGLIVMIALVLFTYE 

HP PPRGAVERHSDLPPLNPTSWffADLRALARNLIFGLITCLTGVLGVGLGVEISRRLRHSNP 
» ***. . . 

CP PERGAADKAMGESIfl)VVVTTHmEDLVILLKOTLVACTWGYTALVFYSGTLSWWEPT 
HP RADPLVCATGLLGSAPFLFLSLACARGSIVATYIPIPIGETLLSMNffAIYADILLYVVIP 
CP VIQETAffHQGLNDTKDLASTDKDRVALYFGAITTAGGLIGYIFGSESKWLVAGWGPFR 
HP TRRSTAEAFQIYLSHLLGDAGSPYLIGLISDRLRRMWPPSFLSEFRALQFSLMLCAFVGA 
CP RLQTDRAQPLVAGGGALLAAPFLLIGMIFGDKSLVLLYIMIFFGITFMCFNWGLNIDMLT 
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HP LGGAAFLGTAIFIEADimQLHYQGLLHEAGSTDDRIYVPW^^ 
CP lYIHPNRRSTAFSYFVLVSHLFGDASGPYLIGLISMIEHGSTO 

5 The search of the GenBank using the base sequences 

of the present cDNA. has revealed the registration of 
sequences that shared a homology of 90% or more (for example;^ 
Accession No. AA460778) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 

10 encode the same protein as the protein of the present 
invention. Furthermore, the search has revealed the 
registration of sequences that shared a homology of 90% or 
more (Accession Wo. K12646) in patent data. However, since 
they are partial sequences, it can not be judged whether or 

15 not they encode the same protein as the protein of the 
present invention . 

<HP10750> (SEQ ID NOS: 127, 137, and 147) 
Determination of the whole base sequence of the 
cDNA insert of clone HP10750 obtained from cDNA library of 

20 human umbilical cord blood revealed the structure consisting 
of a 262--bp 5' -untranslated region, a 1350-bp ORF, and a 
564-bp 3' -untranslated region. The ORF encodes a protein 
consisting of 449 amino acid residues and there existed four 
putative transmembrane domains. Figure 47 depicts the 

25 hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
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Doolittle method, of the present protein. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more {for example, 
5 Accession No. AW304031) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention. 

<HP10777> {SEQ IDNOS: 128, 138, and 148) 

10 Determination of the whole base sequence of the 

cDNA insert of clone HP10777 obtained from cDNA library of 
human kidney revealed the structure consisting of a 15-bp 
5' -untranslated region, a 318-bp ORF, and a 1030^bp 3'- 
untranslated region. The ORF encodes a protein consisting of 

15 105 amino acid residues and there existed a putative 
secretory signal at the N-terminus. Figure 48 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

20 of 14 kDa that was somewhat larger than the molecular weight 
of 11,603 predicted from the ORF. Application of the (-3,-1) 
rule, a method for predicting the cleavage site of the 
secretory signal sequence, allows to expect that the mature 
protein starts from glycine at position 30. 

25 <HP10780> (SEQ ID NOS: 129, 139, and 149) 
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Detemination of the whole base sequence of the 
cDNA insert of clone HP10780 obtained from cDNA library of 
hijman kidney revealed the structure consisting of a 226-bp 
5' -untranslated region, a 246-bp ORF, and a 571-bp 3'- 
5 untranslated region. The ORF encodes a protein consisting of 
81 aiaino acid residues and there existed a putative 
secretory signal at the N-teminus- Figure 49 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

10 translation resulted in formation of a translation product 
of 10 kDa that was -somewhat larger than the molecular weight 
of 8,533 predicted from the ORF. In this case, the addition 
of a microsome led to the formation of a product of . 6 kDa. 
Application of the (-3,-1) rule, a method for predicting the 

15 cleavage site of the secretory signal sequence, allows to 
expect that the mature protein starts from glycine at 
position 25. 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
20 sequences that shared a homology of 90% or more (for example, 
Accession No. AA658245) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

25 <HP10795> (SEQ ID NOS: 130, 140, and 150) 
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Determination of the whole base sequence of the 
cDNA. insert of clone HP10795 obtained from cDNA library of 
human kidney revealed the structure consisting of a 356-*bp 
5' -untranslated region, a 1659-bp ORF, and a 420-bp 3'- 
5 untranslated region. The ORF encodes a protein consisting of 
552 amino acid residues and there existed one transmembrane 
domain at the N-terminus. Figure 50 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

10 translation resulted in formation of a translation product 
of 65 kDa that was almost identical with the molecular 
weight, of 64,280 predicted from the ORF. 

The search of the protein database using the amino 
acid sequence of the present protein revealed that the 

15 protein was similar to hixman UDP-N-acetyl-a-D- 
galactosamine: polypeptide N-acetylgalactosaminyltransf erase 
2 (Accession No» NP_004472) . Table 28 shows the comparison 
between amino acid sequences of the human protein of the 
present invention (HP) and human UDP-N-acetyl-ot-D- 

20 galactosamine : polypeptide N-acetylgalactosaminyltransf erase 
2 (GA) . Therein, the marks of *, and . represent a gap, 
an amino acid residue identical with that of the protein of 
the present invention, and an amino acid residue similar to 
that of the protein of the present invention, respectively. 

25 The both proteins shared a homology of 49.9% in the entire 
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region other than the N- terminal region. 

Table 28 ■ . 
HP MRSLTmVLPVFGVLWITVLLFFWVTKRKLEVPT 

.. 

GAMmSMLCFAFLmGIAYOTSaJGSALAGGAimmDWNEIDPim^ 

HP GPEVQTPKPSDADmWDIffDERRYLNAKmVGDDPmYAFNQRESBRISSMSAIPD 

» * » . *. . » *».»** »** ** ***** 

GA GEEKAQSMETLPPGmWPDFNQMmGTTOSGQDPYMlfflfNQVESBiaMroXAIPD 

HP TRHLRCTLLVYCmPPTSI I ITFHNEARSTLLRTimMTPTHLIREI ILVDDFSND 
**» . * . . »**. **. . *********, ****. *♦*. . . **♦. *♦#****. *** 

GA TEHDQCQRKQWRVDLPATSWITFHNEARSALLRTWSVLKKSPPHLIKEI ILYDDYSND 

HP PDDCKQLIKLPPKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHR 
*. * * **. ». **. **». **** **. . . ********* * . **. »**. * 
GA PBDGALLGKIEKVRVLRNDRREGLMRSRVRGADAAQAmTFLDSHCECNEHEEPLLER 

HP VKEDYTRWCPVIDIINLDTFTYIESmRGGFDWSLHFQWEQLSPEQ-KAKSLDPTEP 
* ** ****. ». »*. **. ». * *......*. *****. * *. *. ..**»..*. . *. . ♦ 

GA VAEDRRWSPIIDVIMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAP 

HP IRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENPEISFRVffMCGGSLEIYPCSRVGHY 
». **. *******. ** . *. ****** **. *****. ******* *******. ******** 
GA IKTPMIAGGLFVMDKFYFEBLGKYDlffiJDVWGGEEEISFRWQCGGSLEIIPCSRVGHV 
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HP FRKKHPYVFPDGNANniKNTMTAEVWMDEYKQmAAia»FALEIffFGN^ 

»**, * **, *. ***»*****. . **** * * . *. **. . »*♦. 

GA FRKtaiPnFPGGSGTYFABmAAEVlffiEYKNFnMWSAm 

HP LRCQSPKmENIYPELSIPKESSITOIRQRQKCLESQRQNNQETPNLKLSPCAKYKG 

GA LSCKPFKWYLENVYPELRVPDHQDIAFGALQQGTHCLDTLGHFADGWG— VYEC ^H 

HP EDAKSQVffAFTYTQQILQEELCLSVITLFPGAPWLVLCKNGDDRQQWTK-TGSHIEHI 

... *. . **..**... *. **. ..»..*. 

GA NAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLIKLQGCEElTOSRQKffEQIEGNSKLSHV 

HP ASaCLDTDMFGDGTENGKEIWNPCESSLMSQHWDMVSS 

GA GSNLCLDS — ^R~TAKSGGLSVEVCGPAL-SQQMFTLKLQQ 

The search of the GenBank using the base sequences 
of the present cDNA has revealed the registration of 
sequences that shared a homology of 90% or more (for example. 
Accession No. AM 60076) among ESTs. However, since they are 
partial sequences, it can not be judged whether or not they 
encode the same protein as the protein of the present 
invention . 

INDUSTRIAL APPLICABILITY 

The present invention provides human proteins 
25 having hydrophobic domains, DNAs encoding these proteins. 



15 



20 
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expression vectors for these DNAs and eukaryotic cells 
expressing these DNAs. Since all of the proteins of the 
present invention are secreted or exist in the cell membrane, 
they are considered to be proteins controlling the 
5 proliferation and/or the differentiation of the cells. 
Accordingly, the proteins of the present invention can be 
eit^jloyed as pharmaceuticals such as carcinostatic agents 
which act to control the proliferation and/or the 
differentiation of the cells, or as antigens for preparing 

10 antibodies against these proteins. The DNAs of the present 
invention can be utilized as probes for the genetic 
diagnosis and gene sources for the gene therapy. Furthermore, 
the DNAs can be utilized for expressing these proteins in 
large quantities. Cells into which these genes are 

15 introduced to express these proteins can be utilized for 
detection of the corresponding receptors or ligands, 
screening of novel small molecule pharmaceuticals and the 
like. The antibody of the present invention can be utilized 
for the detection, quantification, purification and the like 

20 of the protein of the present invention. 

The present invention also provides genes 
corresponding to the polynucleotide sequences disclosed 
herein. "Corresponding genes" are the regions of the genome 
that are transcribed to produce the mRNAs from which cDNA 

25 polynucleotide sequences are derived and may include 
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contiguous regions of the genome necessary for the regulated 
expression of such genes. Corresponding genes may therefore 
include but are not limited to coding sequences, 5' and ^3' 
untranslated regions, alternatively spliced exons, introns, 
5 promoters, enhancers, and silencer or suppressor elements* 
The corresponding genes can be isolated in accordance with 
knovm methods using the sequence information disclosed 
herein. Such methods include the preparation of probes or 
primers from the disclosed sequence information for 

10 identification and/or amplification of genes in appropriate 
genomic libraries or other sources of genomic materials. An 
"isolated gene" is a gene that has been separated from the 
adjacent coding sequences, if any, present in the genome of 
the organism from which the gene was isolated. 

15 Organisms that have enhanced, reduced, or modified 

expression of the gene{s) corresponding to the 
polynucleotide sequences disclosed herein are provided. The 
desired change in gene expression can be achieved through 
the use of antisense polynucleotides or ribozymes that bind 

20 and/or cleave the mRNA transcribed from the gene (Albert and 
Morris, 1994, Trends Pharmacol. Sci. 15(7): 250-254; 
Lavarosky et al., 1997, Biochem. Mol. Med. 62(1): 11-22; and 
Hampel, 1998, Prog. Nucleic Acid Res. Mol, Biol. 58: 1-39; 
all of which are incorporated by reference herein) . 

25 Transgenic animals that have multiple copies of the gene(s) 



wo 01/49728 



PCT/JPOO/09359 



159 

corresponding to the polynucleotide sequences disclosed 
herein, preferably produced by transformation of cells with 
genetic constructs that are stably maintained within the 
transformed cells and their progeny, are provided. 
5 Transgenic animals that have modified genetic control 
regions that increase or reduce gene expression levels, or 
that change temporal or spatial patterns of gene expression, 
are also provided (see European Patent No. 0 649 464 Bl, 
incorporated by reference herein) . In addition, organisms 

10 are provided in which the gene{s) corresponding to the 
polynucleotide sequences disclosed herein have been 
partially or completely inactivated, through insertion of 
extraneous sequences into the corresponding gene{s) or 
through deletion of all or part of the corresponding gene{s). 

15 Partial or complete gene inactivation can be accomplished 
through insertion, preferably followed by imprecise excision, 
of transposable elements (Plasterk, 1992, Bioessays 14(9): 
629-633; Zwaal et al., 1993, Proc. Natl. Acad. Sci. USA 
90(16): 7431-7435; Clark et al., 1994, Proc. Natl. Acad. Sci. 

20 USA 91(2): 719-722; all of which are incorporated by 
reference herein) , or through homologous recombination, 
preferably detected by positive/negative genetic selection 
strategies (Mansour et al., 1988, Nature 336: 348-352; U.S. 
Patent Nos. 5,464,764; 5,487,992; 5,627,059; 5,631,153; 

25 5,614, 396; 5,616,491; and 5,679,523; all of which are 
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incorporated by reference herein) • These organisms with 
altered gene expression are preferably eukaryotes and more 
preferably are mammals. Such organisms are useful for the 
development of non-human models for the study of disorders 
5 involving the corresponding gene(s), and for the development 
of assay systems for the identification of molecules that 
interact with the protein product (s) of the • corresponding 
gene (s) * 

Where the protein of the present invention is 
10 membrane-bound (e.g., is a receptor), the present invention 
also provides for solxable forms of such protein. In such 
forms part or all of the intracellular and transmembrane 
domains of the protein, are deleted such that the protein is 
fully secreted from the cell in which it is expressed. The 
15 intracellular and transmembrane domains of proteins of the 
invention can be identified in accordance with known 
techniques for determination of such domains from sequence 
information • 

Proteins and protein fragments of the present 
20 invention include proteins with amino acid sequence lengths 
that are at least 25% (more preferably at least 50%, and 
most preferably at least 75%) of the length of a disclosed 
protein and have at least 60% sequence identity (more 
preferably, at least 75% Identity; most preferably at least 
25 90% or 95% identity) with that disclosed protein, where 
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sequence identity is determined by comparing the amino acid 
sequences of the proteins when aligned so as to maxiiaize 
overlap and identity while minimizing sequence gaps. Also 
included in the present invention are proteins and protein 
5 fragments that contain a segment preferably coit^rising 8 or 
more (more preferably 20 or more, most preferably 30 or 
more) contiguous amino acids that shares at least 75% 
sequence identity (more preferably, at least 85% identity; 
most preferably at least 95% identity) with any such segment 

10 of any of the disclosed proteins • 

Species homologs of the disclosed polynucleotides 
and proteins are also provided by the preseiit invention. As 
used herein, a "species homologue" is a protein or 
polynucleotide with a different species of origin from that 

15 of a given protein or polynucleotide, but with significant 
sequence similarity to the given protein or polynucleotide, 
as determined by those of skill in the art. Species homologs 
may be isolated and identified by making suitable probes or 
primers from the sequences provided herein and screening a 

20 suitable nucleic acid source from the desired species* 

The invention also encompasses allelic variants of 
the disclosed polynucleotides or proteins; that is, 
naturally-occurring alternative forms of the isolated 
polynucleotide which also encode proteins which are 

25 identical, homologous, or related to that encoded by the 
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polynucleotides • 

The invention also includes polynucleotides with 
sequences • complementary to those of the polynucleotides 
disclosed herein. 

The present invention also includes 
polynucleotides capable of hybridizing under reduced 
stringency conditions, more preferably stringent conditions, 
and most preferably highly stringent conditions, to 
polynucleotides described herein. Examples of stringency 
conditions are shown in the table below: highly stringent 
conditions are those that are at least as stringent as, for 
example, conditions A-F; stringent conditions are at least 
as stringent as, for example, conditions G-L; and reduced 
stringency conditions are at least as stringent as, for 
example, conditions M-R. 
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Table 29 



Stringency 
Condi t ion 


Poly- 
Hybrid 


Hybrid 
(bp)* 


Hybridization Temperature 
and Bu f f e r * 


Wash 

Tenqpe r atur e 
and Buf fer*^ 


A 


DNA : WA 


>50 


65 °C; IxSSC -or- 
42°C; IxSSC, 50% 
formamide 


65 °C; 
0.3XSSC 


B 


DNA : DNA 


<50 


Tb*; IxSSC 


Tb*; ixssc 


C 


DNA : RNA 


^50 


67**C; IxSSC -or- 
45^*0; IxSSC, 50% 
formamide 


67**C; 
0-3xSSC 


D 


DNA : RNA 


<50 


Td*; IxSSC 


Tp*; IxSSC 


E 


RNA : RNA 


>50 


70*C; IxSSC -or- 
50*^0; IxSSC, 50% 
formamide 


70°C; 
O-SxSSC 


F 


RNA : RNA 


<50 


Tf*; IxSSC 


Tp*; IxSSC 


G 


DNA : DNA 


>50 


65 °C; 4xSSC -or- 
42*C; 4xSSC,50% 
formamide 


65°C; IxSSC 


H 


DNA : DNA 


<50 


T„*; 4xSSC 


T„*; 4xSSC 


I 


DNA : RNA 


^50 


67**C; 4xSSC -or- 
45*C; 4xSSC,50% 
formamide 


67**C; IxSSC 


J 


DNA : RNA 


<50 


Tj*; 4xSSC 


Tj*; 4xSSC 


K 


RNA : *RNA 


>50 


70**C; 4xSSC -or- 
50**C; 4xSSC,50% 
formamide 


67**C; IxSSC 


L 


RNA : RNA 


<50 


Tl*; 2xSSC 


Tj.*; 2xSSC 


M 


DNA : DNA 


^50 


50*C; 4xSSC -or- 
40**C; 6xSSC,50% 
formamide 


50°C; 2xSSC 


N 


DNA: DNA 


<50 


Th*; 6xSSC 


Tn*; 6xSSC 


0 


DNA : RNA 


^50 


55*'C; 4xSSC -or- 
42*C; 6xSSC,50% 
formamide 


55"C; 2xSSC 


P 


DNA : RNA 


<50 


Tp*; 6xSSC 


Tp*; 6xSSC 


Q 


RNA : RNA 


>50 


60"C; 4xSSC -or- 
45**C; 6xSSC,50% 
formamide 


60**C; 2xSSC 


R 


RNA : RNA 


<50 


Tr*^; 4xSSC 


Tr*; 4xSSC 
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+ : The hybrid length is that anticipated for the 
hybridized region (s) of the hybridizing polynucleotides. 
When hybridizing a polynucleotide to a target polynucleotide 
of unknown sequence^ the hybrid length is assumed to be that 
5 of the hybridizing polynucleotide. When polynucleotides of 
known sequence are hybridized, the hybrid length can be 
determined by aligning the sequences of the polynucleotides 
and identifying the region or regions of optimal sequence 
complementarity. 

10 t : SSPE (IxSSPE is O.lSMNaCl, lOiriM NaH2P04/ and 1.25itiM 

EDTA, pH7.4) can be substituted for SSC (I^SSC is 0.15M NaCl 
and 15mM sodium citrate) in the hybridization and wash 
buffers; washes are performed for 15 minutes after 
hybridization is complete. 

15 *Tb - Tr : The hybridization temperature for hybrids 

anticipated to be less than 50 base pairs in length should 
be S-lC'c less than the melting temperature {TJ of the 
hybrid, where T„ is determined according to the following 
equations. For hybrids less than 18 base pairs in length, 

20 T„("C)=2{#of A + T bases) + 4(# of G + C bases) . For hybrids 
between 18 and 49 base pairs in length, 1^^*0=^81.5 + 
16.6(logio[Na''] ) +0.41 {%G+C) - (600/N), where N is the 
number of bases in the hybrid, and [Na*] is the concentration 
of sodiiom ions in the hybridization buffer ( [Na*] for 

25 lxSSC=0.165M) . 
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Additional examples of stringency conditions for 
polynucleotide hybridization are provided in Sambrook, J,, 
E.F. Fritsch, and Maniatis, 1989, Molecular Cloning: A 
5 Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, chapters 9 and 11, and Current Protocols 
in Molecular Biology, 1995, F-M. Ausubel et al., eds., John 
Wiley & Sons, Inc., sections 2-10 and 6.3-6.4, incorporated 
herein by reference • 

10 Preferably, each such hybridizing polynucleotide 

has a length that is at least 25% (more preferably at least 
50%, and most preferably at least 75%) of the length of the 
polynucleotide of the present invention to which it 
hybridizes, and has at least 60% sequence identity (more 

15 preferably, at least 75% identity; most preferably at least 
90% or 95% identity) with the polynucleotide of the present 
invention to which it hybridizes, where sequence identity is 
determined by comparing the sequences of the hybridizing 
polynucleotides when aligned so as to maximize overlap and 

20 identity while minimizing sequence gaps. 
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CLAIMS 

1. A protein comprising any one of amino acid 
sequences selected from the group consisting of SEQ ID NOS: 

5 1 to 10, 31 to 40, 61 to 70, 91 to 100 and 121 to 130. 

2. An isolated DNA encoding the protein according to 
Claim 1. 

3. An isolated cDNA comprising any one of base 
sequences selected from the group consisting of SEQ ID NOS: 

10 11 to 20, 41 to 50, 71 to 80, 101 to 110 and 131 to 140. 

4. The cDNA according to Claim 3 consisting of any 
one of base sequences selected from the group consisting of 
SEQ ID NOS: 21 to 30, 51 to 60, 81 to 90, 111 to 120 and 141 
to 150. 

15 5. An expression vector that is capable of expressing 

the DNA according to any one of Claim 2 to Claim 4 by in 
vitro translation or in eukaryotic cells. 

6. A transformed eukaryotic cell that is capable of 
expressing the DNA according to any one of Claim 2 to Claim 

20 4 and of producing the protein according to Claim 1. 

7. An antibody directed to the protein according to 
Claim 1. 
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115 120 125 

Leu Arg Val His Leu Asp lie Gin Val Gly Glu His Ala Asn Asn Tyr 

130 135 140 

Pro Glu lie Ala Ala Lys Asp Lys Leu Thr Glu Leu Gin Leu Arg Ala 
5 145 150 155 160 

Arg Gin Leu Leu Asp Gin Val Glu Gin lie Gin Lys Glu Gin Asp Tyr 

165 170 175 

Gin Arg Tyr Arg Glu Glu Arg Phe Arg Leu Thr Ser Glu Ser Thr Asn 
180 185 190 

10 Gin Arg Val Leu Trp Trp Ser lie Ala Gin Thr Val He Leu He Leu 
195 200 205 

Thr Gly He Trp Gin Met Arg His Leu Lys Ser Phe Phe Glu Ala Lys 

210 215 220 

Lys Leu Val 
15 225 



<210> 92 
<211> 352 
<212> PRT 
20 <213> Homo sapiens 



<400> 92 

Met Glu Ser Gly Gly Arg Pro Ser Leu Cys Gin Phe He Leu Leu Gly 

1 5 10 15 

25 Thr Thr Ser Val Val Thr Ala Ala Leu Tyr Ser Val Tyr Arg Gin Lys 
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20 25 30 

Ala Arg Val Ser Gin Glu Leu Lys Gly Ala Lys Lys Val His Leu Gly 

35 40 45 

Glu Asp Leu Lys Ser He Leu Ser Glu Ala Pro Gly Lys Gys Val Pro 
5 50 55 60 

Tyr Ala Val He Glu Gly Ala Val Arg Ser Val Lys Glu Thr Leu Asn 
65 70 75 80 

Ser Gin Phe Val Glu Asn Cys Lys Gly Val He Gin Arg Leu Thr Leu 
85 90 95 

10 Gin Glu His Lys Met Val Trp Asn Arg Thr Thr His Leu Trp Asn Asp 
100 105 110 

Cys Ser Lys He He His Gin Arg Thr Asn Thr Val Pro Phe Asp Leu 

115 120 125 

Val Pro His Glu Asp Gly Val Asp Val Ala Val Arg Val Leu Lys Pro 
15 130 135 140 

Leu Asp Ser Val Asp Leu Gly Leu Glu Thr Val Tyr Glu Lys Phe His 
145 150 155 160 

Pro Ser He Gin Ser Phe Thr Asp Val He Gly His Tyr He Ser Gly 
165 170 175 

20 Glu Arg Pro Lys Gly He Gin Glu Thr Glu Glu Met Leu Lys Val Gly 
180 185 190 

Ala Thr Leu Thr Gly Val Gly Glu Leu Val Leu Asp Asn Asn Ser Val 

195 200 205 

Arg Leu Gin Pro Pro Lys Gin Gly Met Gin Tyr Tyr Leu Ser Ser Gin 
25 210 215 220 
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Asp Phe Asp Ser Leu Leu Gin Arg Gin Glu Ser Ser Val Arg Leu Trp 
225 230 235 240 

Lys Val Leu Ala Leu Val Phe Gly Phe Ala Thr Cys Ala Thr Leu Phe 
245 250 255 

5 Phe lie Leu Arg Lys Gin Tyr Leu Gin Arg Gin Glu Arg Leu Arg Leu 
260 265 270 

Lys Gin Met Gin Glu Glu Phe Gin Glu His Glu Ala Gin Leu Leu Ser 

275 280 285 

Arg Ala Lys Pro Glu Asp Arg Glu Ser Leu Lys Ser Ala Cys Val Val 
10 290 295 300 

Cys Leu Ser Ser Phe Lys Ser Cys Val Phe Leu Glu Cys Gly His Val 
305 310 315 320 

Cys Ser Cys Thr Glu Cys Tyr Arg Ala Leu Pro Glu Pro Lys Lys Cys 
325 330 335 

15 Pro He Cys Arg Gin Ala He Thr Arg Val He Pro Leu Tyr Asn Ser 
340 345 350 



<210> 93 
20 <211> 130 
<212> PRT 

<213> Homo sapiens 
<400> 93 

25 Met Ser Ser Ser Gly Gly Ala Pro Gly Ala Ser Ala Ser Ser Ala Pro 



